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Introduction 


This  is  the  second  of  the  two  units  in  which  we  study  patterns  of  events.  In  Unit  2 
we  concentrated  on  patterns  of  events  in  time;  in  this  unit  we  shall  concentrate  on 
patterns  in  space.  The  main  way  in  which  these  differ  from  patterns  of  events  in 
time  is  that  they  usually  occur  in  more  than  one  dimension.  Those  in  this  unit  are 
mainly  two-dimensional  patterns  which  are  spread  over  an  area  of  land:  for 
example,  animals  grazing  in  a  field,  oak  trees  in  a  wood,  or  shops  in  a  town. 
Three-dimensional  patterns,  e.g.  dust  particles  in  a  room,  stars  in  a  galaxy  or 
galaxies  in  the  universe,  are  also  of  interest;  but  these  are  more  complicated  and 
will  not  be  considered  in  this  course.  One-dimensional  spatial  patterns,  e.g.  cars  on 
a  road  at  some  instant  of  time,  or  prime  numbers  on  the  real  line,  occur  less 
frequently  and  these  can  be  studied  by  the  methods  of  Unit  2. 

In  Section  1,  several  examples  of  spatial  patterns  are  described  and  illustrated,  and 
some  of  the  differences  between  them  are  pointed  out.  The  two-dimensional  forms 
of  the  two  basic  patterns  in  one  dimension— the  Bernoulli  process  and  the  Poisson 
process  are  introduced  in  Section  2,  and  examples  of  them  are  simulated.  In 
Section  3  we  shall  look  at  the  two  main  ways  in  which  patterns  can  differ  from  the 
basic  patterns  of  Section  2:  either  there  is  some  attraction  between  objects,  for 
example  grazing  animals  might  group  together  in  families,  and  this  produces  a 
pattern  which  shows  clustering;  or  objects  may  for  some  reason  not  be  found  very 
close  together— for  example,  each  animal  may  have  its  own  exclusive  grazing 
territory— and  this  produces  some  regularity  in  the  pattern.  Probability  models 
are  introduced  for  these  two  situations,  and  patterns  are  simulated.  Section  4  is 
more  mathematical  than  the  rest  of  the  unit;  the  index  of  dispersion  is  calculated 
for  several  models  of  spatial  patterns,  and  we  shall  consider  the  variability  in  the 
distance  between  neighbouring  objects  in  a  spatial  pattern.  Section  5  makes  an 
exception  to  the  usual  policy  of  this  course,  describing  two  ways  in  which  we  can 
decide  on  statistical  grounds  whether  or  not  a  given  pattern  follows  a  Poisson 
process. 

Section  1  is  introductory,  very  short  and  contains  no  questions.  Sections  2  and  3 
are  mainly  descriptive  but  contain  numerical  simulations.  Section  4  will  require 
most  study  time.  If  you  have  studied  M345,  you  will  find  Section  5  very 
straightforward,  though  you  only  need  to  know  basic  ideas  about  hypothesis  tests 
(such  as  contained  in  M245 )  for  this  section. 

There  are  no  audio-cassette  sessions  or  video-cassette  bands  associated  with  this 
unit. 


1  What  are  spatial  patterns? 


We  begin  by  looking  at  some  illustrative  examples  of  spatial  patterns. 

Example  1.1  Free-flowing  traffic 

Figure  1.1(a)  shows  the  positions  of  vehicles  on  the  eastbound  carriageway  of  the 
Great  West  Road  at  Heston.  This  diagram  was  derived  from  part  of  an  aerial 
photograph  of  the  traffic,  shown  in  Figure  1.1(b);  each  dot  represents  the  front  of 
the  bonnet  of  a  vehicle.  The  lengths  of  the  vehicles  are  ignored,  and  all  vehicles  are 
represented  on  a  single  line,  although  actually  the  road  has  three  lanes,  each 
containing  some  vehicles.  The  road  was  busy,  but  the  traffic  was  flowing 
freely.  □ 


Figure  1.1(a)  Positions  of  vehicles  in  free-flowing  traffic 


Figure  1.1(b) 


Figure  1.1(a)  is  based  on  Figure  3.6 
in  M245  Unit  1.  The  original  aerial 
photograph  is  also  reproduced  there. 
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Example  1.2  Heavy  traffic 

Figure  1.2  shows,  in  a  similar  way  to  Figure  1.1(a),  the  positions  of  vehicles  in  an 
eastbound  stream  of  traffic  on  the  coast  road  from  Blackpool  to  Preston,  which 
has  only  a  single  lane  in  each  direction.  This  is  also  derived  from  an  aerial 
photograph,  taken  on  a  Sunday  evening  when  the  road  was  very  congested.  □ 


Figure  1.2  is  derived  from  the  aerial 
photograph  which  is  reproduced  in 
M245  Unit  1,  Figure  3.7. 


Figure  1.2  Positions  of  vehicles  in  heavy  traffic 


Examples  1.1  and  1.2  both  illustrate  a  pattern  of  objects  in  one-dimensional  space. 
You  can  see  that  the  patterns  are  very  different.  Most  of  the  objects  (vehicles  in 
this  case)  shown  in  Figure  1.2  are  spaced  quite  regularly;  this  road  was  essentially 
full  and  the  vehicles  were  as  close  as  possible.  On  the  other  hand,  in  Figure  1.1(a) 
there  is  much  less  regularity;  some  objects  are  closely  bunched  together,  but  there 
are  some  long  gaps. 


Example  1.3  A  forest 


Figure  1.3  shows  the  positions  of  211  trees  of  the  white  oak  species  growing  within 
a  square-shaped  area  of  19.6  acres  in  Lansing  Woods,  Michigan.  White  oaks  are 
just  one  of  several  species  of  tree  growing  in  these  woods.  You  can  see  that  the 
white  oaks  are  not  spaced  evenly  through  the  wood;  there  are  some  very  sparse 
areas  and  some  where  the  trees  are  growing  much  more  densely.  □ 


Source  of  data:  Dr  D.  J.  Gerrard; 
diagram  reproduced  from  Some 
Problems  in  the  Analysis  of  Spatial 
Pattern  by  Trevor  F.  Cox, 

Ph.D.  thesis,  University  of  Hull. 


Figure  1.3  The  positions  of  21 1  white  oak  trees  in 
Lansing  Woods,  Michigan 


Example  1.4 

A  narrow  strip  or  transect  is  marked  on  Figure  1.3.  The  positions  of  trees  in  this 
transect,  represented  as  points  along  a  straight  line,  are  shown  in  Figure  1.4.  This 
is  a  one-dimensional  pattern  of  objects  which  has  been  derived  from  the  two- 
dimensional  pattern  of  Example  1.3.  □ 


Figure  1.4  White  oak  trees  along  a  transect 
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Example  1.5  London  post  offices 

Figure  1.5  shows  the  locations  of  the  62  post  offices  in  a  rectangular  area  of 
central  London,  approximately  6  km  E/W  by  4.5  km  N/S.  You  can  see  that  there 
are  no  post  offices  (obviously)  in  the  parks,  but  otherwise  there  is  a  fairly  even 
spread.  □ 
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Figure  1.5  Post  offices  in  London 


Example  1.6  Redwood  seedlings 


The  positions  of  194  redwood  seedlings  found  on  a  square  experimental  plot  are 
shown  in  Figure  1.6.  It  was  thought  that  seedlings  would  be  scattered  fairly 
randomly  except  that  clusters  of  seedlings  would  appear  round  the  old  redwood 
tree  stumps  present  in  the  plot.  A  discontinuity  in  the  soil,  very  roughly  marked  by 
the  diagonal  line  in  Figure  1.6,  might  be  expected  to  cause  different  types  of 
pattern  in  region  I  (72  seedlings)  and  region  II  (122  seedlings).  □ 


The  original  paper  refers  to  a  total 
of  199  seedlings,  77  of  them  in 
region  I.  The  diagram  does  not 
show  as  many  as  that. 

Source:  David  J.  Strauss,  ‘A  model 
for  clustering’,  Biometrika,  vol.  62 
(1975)  p.  467. 


Figure  1.6  Redwood  seedlings  on 
an  experimental  plot 


Example  1.7  Ants’  nests 

Figure  1.7  shows  the  sites  of  the  nests  of  two  species  of  ant,  Messor  wasmanni  and 
Cataglyphis  bicolor,  in  an  area  of  approximately  90  m  x  90  m  in  northern  Greece. 
The  region  is  divided  into  two  main  parts:  the  north-west  part  is  open  scrubland 
and  the  south-east  is  a  field.  Ants  cannot  build  nests  at  some  sites  in  the  scrub 
where  bushes  are  growing.  There  is  a  track  down  the  west  side  of  the  region. 
Altogether  there  are  68  Messor  nests  and  29  Cataglyphis  nests.  □ 


Source:  R.  D.  Harkness  and  Valerie 
Isham,  ‘A  bivariate  spatial  point 
pattern  of  ants’  nests’,  Applied 
Statistics,  vol.  32  (1983)  p.  293. 


In  all  these  examples,  the  figures  show  the  positions  of  objects  observed  at  a 
particular  instant  of  time.  The  objects  are  vehicles  in  Examples  1.1  and  1.2,  trees  in 
Examples  1.3  and  1.4,  post  offices  in  Example  1.5,  seedlings  in  Example  1.6  and 
ants’  nests  in  Example  1.7.  In  the  last  example  there  are  two  kinds  of  object — nests 
of  two  species  of  ant — but  in  the  other  examples  the  objects  are  all  of  the  same 
kind. 

In  the  patterns  of  events  in  time  discussed  in  Unit  2,  each  point  in  a  pattern  was 
called  an  event.  What  name  shall  we  give  to  the  points  in  our  spatial  patterns,  e.g. 
the  211  tree  positions  in  Figure  1.3?  Some  authors  call  them  events,  as  in  the 
patterns  along  a  time  axis;  but  this  seems  rather  an  artificial  word  for  trees  or  post 
offices.  Then  why  not  call  them  points?  This  sounds  a  natural  choice,  but,  as  you 
will  see,  we  need  to  keep  the  word  ‘point’  for  a  different  purpose  (see  Section  4).  So 
in  this  unit  we  shall  call  ehch  point  in  the  pattern  an  object,  objects  being  trees, 
post  offices,  ants’  nests  or  whatever,  according  to  the  context. 

In  this  unit  we  are  concerned  not  with  the  shapes  or  sizes  of  objects  but  only  with 
their  positions.  The  objects  can  be  represented  as  points  because  the  distances 
between  them  are  generally  large  compared  with  the  sizes  of  individual  objects.  In 
more  detailed  analyses  of  spatial  patterns,  the  size  or  shape  of  objects  could  well 
be  of  importance — for  example,  the  size  of  red  blood  corpuscles  is  of  the  same 
order  of  magnitude  as  the  distance  between  them — but  we  shall  assume  that  an 
object  can  be  thought  of  as  lying  at  a  single  point. 

In  each  example,  the  points  form  a  pattern  in  space  or  spatial  pattern.  In  Examples 
1.1,  1.2  and  1.4,  the  space  is  one-dimensional  and  the  points  are  strung  out  along  a 
straight  line  (representing  a  road  in  Examples  1.1  and  1.2,  and  a  transect  of  a 
forest  in  Example  1.4).  In  the  other  examples,  the  space  is  two-dimensional,  and 
the  points  are  positioned  over  a  region. 
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Patterns  are  also  formed  by  the  positions  of  objects  in  three-dimensional  space,  e.g. 
stars,  jellyfish  in  a  particular  location  in  the  sea,  or  dust  particles  within  a  room. 
No  examples  are  given  of  these  as  they  are  hard  to  illustrate  diagrammatically, 
and  they  will  not  be  discussed  in  this  unit.  The  properties  of  spatial  patterns  in 
three  dimensions  follow  similar  principles  to  those  of  patterns  in  two  dimensions. 

From  the  point  of  view  of  probability  modelling  and  statistical  analysis,  spatial 
patterns  of  objects  along  a  line  (e.g.  Figures  1.1(a),  1.2,  1.4)  are  essentially  the  same 
as  patterns  of  events  in  time,  which  were  discussed  in  Unit  2.  For  example,  one  can 
have  a  Poisson  process  of  time  instants  at  which  successive  events  occur  (such  as 
earthquakes),  or  a  Poisson  process  of  positions  of  objects  along  a  line,  observed  at 
a  fixed  moment  in  time  (such  as  vehicles  on  an  uncongested  motorway  in  an  aerial 
photograph).  From  the  probability  point  of  view,  they  are  the  same  kind  of 
process.  So  patterns  such  as  those  in  Examples  1.1,  1.2  and  1.4  can  be  studied  by 
the  methods  of  Unit  2,  and  we  need  pursue  the  one-dimensional  case  no  further. 
Accordingly,  this  unit  will  concentrate  on  spatial  patterns  in  two  dimensions. 

What  sort  of  questions  might  we  want  to  ask  about  a  spatial  pattern?  Here  are 
some.  Can  the  objects  in  a  particular  pattern  be  taken  to  be  randomly  distributed 
over  the  area  they  occupy?  Indeed,  what  does  ‘randomly  distributed’  mean?  If  the 
pattern  is  not  random,  how  can  we  describe  it  and  what  are  its  characteristics? 
What  is  a  plausible  probability  model  for  the  underlying  process?  What  kinds  of 
pattern  are  there?  These  are  some  of  the  questions  which  will  be  discussed  in  the 
remaining  sections  of  this  unit. 

In  all  the  above  examples,  the  spatial  pattern  occurs  over  continuous  space  so  that 
an  object  can  appear  at  any  position  on  the  given  line  or  over  the  given  area. 
Analogous  to  events  in  time,  patterns  in  space  may  also  occur  in  discrete  space, 
though  in  fact  these  rarely  occur  in  practice.  One  example  is  the  pattern  that  arises 
when  trees  are  planted  in  a  regular  pattern,  and  after  a  period  of  time  some  of  the 
trees  have  died.  Most  of  the  models  discussed  in  this  unit  are  for  patterns  in 
continuous  space,  but  some  models  are  described  for  patterns  in  discrete  space. 

In  the  next  section  models  will  be  introduced  for  spatial  patterns  that  are  the  two- 
dimensional  analogues  of  the  familiar  Bernoulli  model  (discrete)  and  Poisson 
model  (continuous)  for  point  processes  in  one  dimension. 


2  Random  patterns  and  the  Poisson  process  in  two 
dimensions 


The  object  of  this  section  is  to  describe  the  concepts  of  random  patterns  in  both 
discrete  and  continuous  two-dimensional  space.  These  concepts  are  introduced  by 
means  of  simulations;  firstly,  in  Subsection  2.1,  with  the  familiar  Bernoulli  and 
Poisson  processes  in  one  dimension;  then,  in  Subsection  2.2,  the  simulation 
procedures  are  extended  to  two  dimensions.  This  leads  to  a  definition  of  a  Poisson 
process  in  two  dimensions. 


2.1  Simulating  spatial  patterns  in  one  dimension 

Example  2.1  Simulating  a  Bernoulli  process 

The  first  50  random  digits  in  row  1  of  the  table  on  page  42  of  Neave  are  given  in 
Figure  2.1(a).  There  are  six  9s  among  the  50  digits  and  these  are  shown  ringed. 
The  positions  of  the  9s  generate  a  pattern  of  objects  on  a  line,  as  shown  in 
Figure  2.1(b). 


The  relationship  of  patterns  along  a 
line  and  patterns  in  time  is  discussed 
in  M245  Unit  1,  with  particular 
reference  to  Poisson  processes;  see, 
in  particular,  pages  32,  38-9. 
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Question  2.1 

(i)  Suppose  the  remaining  50  digits  in  row  1  of  the  table  are  used  to  generate 
another  pattern  of  objects  on  a  line.  How  many  objects  will  this  pattern 
contain? 

(ii)  We  have  now  made  two  observations  on  a  random  variable  X,  which  is  the 
number  of  9s  in  50  random  digits.  What  is  the  distribution  of  XI  □ 

'The  second  realization,  obtained  from  the  last  50  digits  of  row  1  of  the  table  is 

shown  in  Figure  2.1(c). 

(a)  0  2  4  8  4  8  8  1  3®  3  178835873632  5®@®  8  8620644418534  1  ®  1  5  0  2®4 


(b) 


10 


15 


20 


(c)  _ _ _ _  .  i  -  -r  ,  ,  , 

5  10  15  20 

Figure  2.1  Realizations  of  a  Bernoulli  process 


'  30  ' 
30 


35  '  '  '  40  ' 

'  35  '  '  40 


The  objects  in  this  process,  i.e.  the  different  occurrences  of  the  digit  9,  can  be 
located  only  at  the  discrete  points  1,2,  ...,  50  along  the  line,  so  Figure  2.1  shows 
two  realizations  of  a  Bernoulli  process  in  one-dimensional  space  with  parameter 
P  =  0.1.  □ 


45 


45 


As  p  was  chosen  to  be  quite  small,  the  Bernoulli  process  is  fairly  close  to  a  Poisson 
process.  However,  the  positions  of  objects  in  a  Poisson  process  in  one-dimensional 
space  can  be  easily  simulated.  Probably  the  simplest  way  to  do  this,  as  described 
in  Unit 2*  is  to  simulate  successive  intervals  using  random  numbers  from  an 
exponential  distribution.  However,  for  reasons  which  will  soon  become  clear,  a 
different  method,  based  on  the  uniform  distribution,  is  now  introduced. 

Suppose  we  know  that  along  some  interval,  AB,  of  a  line,  just  one  object  is  , _ 

observed  from  a  Poisson  process;  see  Figure  2.2.  Now,  by  the  first  postulate  of  the  A 
Poisson  process,  the  probability  that  an  object  (event)  occurs  in  any  small  interval  Figure  2.2 
of  fixed  length  is  the  same,  wherever  that  interval  lies.  So  it  follows  that  the  object 
is  equally  likely  to  be  at  any  point  P  on  the  line.  In  probability  terms,  the  ratio 
AP/AB  has  a  uniform  distribution  (7(0, 1).  Now  suppose  that  instead  of  just  one 
object  we  know  that  k  objects  are  observed  from  a  Poisson  process  on  the  line  AB. 

By  the  third  postulate,  objects  occur  independently  of  each  other.  So  if  the 

positions  of  the  k  objects  are  Pu  P2,  ...,  Pk,  then  each  of  the  ratios 

APJAB,  ...,  APJAB  has  a  uniform  distribution  17(0, 1)  and  these  distributions  are 

independent. 

Example  2.2  Simulating  a  Poisson  process 

In  order  to  be  comparable  with  the  simulation  of  the  Bernoulli  process,  we  shall 
generate,  on  a  line  50  units  in  length,  a  realization  of  a  Poisson  process  with 
density  2  =  0.1  objects  per  unit  length.  When  events  occur  in  time,  2  is  defined  as 
the  rate  of  occurrence  of  events  per  unit  time;  now  that  a  spatial  pattern  of  objects 
is  being  described,  the  word  density  is  more  appropriate.  It  is  also  used  for  two- 
dimensional  patterns,  where  2  is  the  density  of  objects  per  unit  area. 

The  simulation  is  carried  out  in  two  stages.  The  first  stage  is  to  simulate  x,  the 
number  of  objects  in  the  length  of  50  units.  This  is  an  observed  value  of  a 
random  variable  X  whose  distribution  is  Poisson  with  parameter 
2  x  length  =  0.1  x  50  =  5.  This  simulation  is  done  by  the  method  described  in 
Unit  1,  Subsection  6.2,  which  is  that  if  u  is  an  observed  value  from  1/(0, 1),  then  the 
simulated  value  x  for  the  integer-valued  random  variable  X  satisfies 
F(x  —  1)  <  u  <  F(x), 

where  F(x)  is  the  c.d.f.  of  X.  Using  the  table  of  probabilities  of  the  Poisson 
distribution  on  page  14  of  Neave, 

if  0  <  u  <  0.0067,  then  x  =  0, 
if  0.0067  <  u  <  0.0404,  then  x  =  1,  etc. 

A  value  of  u  is  obtained  simply  by  reading  random  digits  from  page  42  of  Neave. 


Starting  (for  instance)  at  the  top  left-hand  corner  and  reading  down  the  first 
column  gives  0,8, 3,0,  so  we  shall  take  u  =  0.0830.  Since  0.0404  <  0.0830  <  0.1246, 
this  gives  x  =  2.  This  completes  the  first  stage  of  the  simulation. 

The  second  stage  therefore  involves  simulating  the  positions  of  two  points  on  the 
straight  line  AB  of  length  50  units.  As  you  saw  above,  each  of  the  ratios  APJAB 
and  AP2/AB  has  a  uniform  distribution,  U( 0, 1).  Reading  down  the  second  column 
of  random  digits,  we  obtain 

APJAB  =  0.237,  AP2/AB  =  0.426, 

giving  APX  =  11.85,  AP2  =  21.3.  The  simulated  realization  is  illustrated  in 
Figure  2.3.  □ 

A  P2 

[ - !—• - 1  • - 1 - 1 - — - 1 

A  10  20  30  40  B 

Figure  2.3  A  simulated  realization  of  a  Poisson  process  with  density  0.1 

Question  2.2  Simulate  a  second  realization  of  a  Poisson  process  in  one¬ 
dimensional  space  with  density  0.1  objects  per  unit  length  over  a  length  of  50 
units,  using  the  same  method  as  in  Example  2.2.  Use  the  second  block  of  the  first 
column  of  the  table  of  random  digits  (9,4,3,  ...)  to  simulate  x,  and  continue  down 
the  second  column  (2, 1,7,  ...)  to  simulate  the  ratios.  Draw  a  rough  diagram  of  the 
simulation.  □ 


That  is,  there  are  two  objects  in  the 
simulation. 


Since  all  we  want  to  do  is  to  show  a 
diagram  of  a  simulation,  there  is  no 
need  to  take  the  values  of  the  ratios 
to  more  than  three  places  of 
decimals. 


In  this  subsection  we  have  looked  at  random  patterns  of  objects  along  a  line  in 
both  discrete  and  continuous  space.  These  concepts  will  now  be  extended  to  two- 
dimensional  space. 


2.2  Random  patterns  in  two  dimensions 


Example  2.3  Simulating  a  Bernoulli  process  in  two  dimensions 


In  Example  2.1,  we  used  row  1  of  the  table  on  page  42  of  Neave  to  simulate  a  one¬ 
dimensional  process.  Figure  2.4  shows  a  20  x  20  square  array  of  the  first  20  digits 
in  the  first  20  rows  of  the  same  table.  There  are  41  9s  among  the  400  digits 
(surprisingly  close  to  the  40  expected),  and  these  are  shown  ringed.  In  Figure  2.5 
their  positions  are  shown  on  the  vertices  of  a  20  x  20  square  lattice.  The  word 
‘square’  refers  to  the  shape  of  the  individual  cells,  not  to  the  fact  that  the  pattern 
in  Figure  2.5  happens  to  cover  a  square  area  of  20  x  20  units.  □ 
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8  3  6  8  0  5  6  1  3  1  1  2  2  3  8  6  8  2®  1 
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0406046030237  5  16188  0 
6204  0  01  81246847  7®  3  5  2 
(9)6  4  1  7  63  3  3  6  8  8  4®  1  7  325® 
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4  8  2  5  5  2  0  8  1  5  5  1  3  2  2  0  4®  3  6 
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6 ® 1  0  1  2  1  1®2002  5  68  1645 

Figure  2.4  Simulation  of  a  Bernoulli 
process 


Figure  2.5  A  random  pattern  in  discrete 
space 


The  positions  marked  in  Figure  2.5  demonstrate  a  simulation  of  a  two-dimensional 
Bernoulli  process.  In  such  a  process,  at  each  point  (x,  y)  of  the  square  lattice  there 
can  be  either  an  object  (with  probability  p)  or  no  object  (with  probability 
<3  =  1  -  P);  occurrences  of  objects  at  different  lattice  points  are  independent.  In  the 
simulation  above,  p  is  equal  to  0.1.  Obviously,  Bernoulli  processes  with  different 
values  of  p  could  be  simulated  similarly,  if  necessary  using  more  than  one  digit  for 
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each  possible  position.  The  process  could  be  defined  formally  as 

{^(x.y);  x,y  =  1,2,  20},  where  X(x,y)  ~  B(l,p).  However,  formal  definitions 

will  not  be  used  in  this  unit. 

The  two-dimensional  Bernoulli  process  can  be  regarded  as  a  random  pattern  in 
discrete  space.  Just  as  the  processes  are  analogous  in  one  dimension,  the 
continuous  analogue  of  the  two-dimensional  Bernoulli  process  is  the 
two-dimensional  Poisson  process.  This  is  defined  in  essentially  the  same  way  as  an 
ordinary  one-dimensional  Poisson  process;  there  are  the  same  three  postulates,  the 
only  change  being  that  they  are  expressed  in  terms  of  elementary  areas  da  instead 
of  elementary  time  (or  length)  intervals  [t,t  +  <5t].  For  the  two-dimensional 
Poisson  process,  they  are  as  follows. 


I  The  probability  that  there  is  (exactly)  one  object  in  any  small  region  of 
area  Sa  is  equal  to  Ada  +  o{da). 

II  The  probability  that  there  are  two  or  more  objects  in  the  small  region 
of  area  da  is  equal  to  o(da). 

III  The  presence  of  an  object  is  any  region  Ax  and  the  presence  of  an  object 
in  any  non-overlapping  region  A2  are  independent  events. 


The  parameter  A  is  called  the  density  of  the  Poisson  process,  and  signifies  the  mean 
number  of  objects  per  unit  area  (e.g.  trees  per  acre,  post  offices  per  square  mile). 

An  important  difference  between  a  one-dimensional  and  a  two-dimensional 
Poisson  process  is  that  in  the  latter  there  is  no  obvious  ordering  of  objects  (or 
events).  In  a  one-dimensional  process  in  time,  events  occur  in  a  definite  order  and 
we  can  talk  of  the  time  between  successive  events  or  of  the  time  until  six  events 
have  occurred.  With  a  spatial  pattern,  all  the  objects  are  assumed  to  be  in  position 
at  some  moment  in  time.  We  shall  return  to  this  difference  later  in  the  unit. 

For  the  one-dimensional  Poisson  process  in  time,  the  number  of  events  occurring 
in  any  time  interval  of  duration  t  has  a  Poisson  distribution  with  mean  It.  A 
corresponding  property  also  holds  for  the  Poisson  process  in  two  dimensions. 


For  a  Poisson  process  in  two-dimensional  space  with  mean 
density  A  per  unit  area,  the  number  of  objects  in  any  region 
of  area  A  has  a  Poisson  distribution  with  mean  A  A. 


This  result  is  derived  in  exactly  the 
same  way  as  that  for  the  one¬ 
dimensional  process. 


Question  2.3  Primroses  grow  in  a  clearing  in  a  wood  with  a  density  of  1.2  plants 
per  square  metre. 

(i)  What  is  the  probability  that  there  are  exactly  three  primroses  in  a  square  of 
area  one  square  metre? 

(ii)  What  is  the  probability  that  there  are  exactly  three  primroses  in  a  circle  of 
area  one  square  metre? 

(iii)  What  is  the  probability  that  there  are  no  primroses  growing  in  a  particular 
area  of  one  square  metre? 

(iv)  What  is  the  probability  that  at  least  five  primroses  are  found  in  an  area  of  two 
square  metres?  □ 

We  shall  now  simulate  a  realization  of  a  two-dimensional  Poisson  process,  using 
the  same  technique  as  was  used  in  Subsection  2.1  to  simulate  a  one-dimensional 
Poisson  process. 

Example  2.4  Simulating  a  Poisson  process  in  two  dimensions 

We  shall  produce  a  simulation  of  a  two-dimensional  Poisson  process  with  density 
A  =  0.1  over  an  area  of  observation  which  is  a  square  of  20  units  by  20  units.  This 
will  be  a  continuous  analogue  of  the  pattern  in  Figure  2.5. 

The  simulation  will  be  carried  out  in  two  stages  in  a  similar  manner  to  that  used 
in  Example  2.2  for  a  one-dimensional  Poisson  process.  Firstly,  we  shall  simulate 
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the  number,  x,  of  objects  in  the  square;  secondly,  we  shall  simulate  the  position  of 
each  object  in  the  square. 

The  number  of  objects,  X,  has  a  Poisson  distribution  with  parameter 
IA  =  0.1  x  202  =  40.  A  value  of  X  is  simulated  by  the  standard  procedure,  using  a 
simulated  value,  u,  from  U( 0, 1)  and  choosing  x  to  satisfy  F(x  —  1)  <  u  <  F(x), 
where  F(x)  is  the  c.d.f.  of  X.  This  time,  we  shall  use  the  bottom  line  of  the  table  of 
random  digits  in  Neave,  page  42,  which  gives  u  =  0.1960.  From  the  table  of  the 
Poisson  distribution  in  Neave,  page  16,  with  n  =  40,  we  have  F(34)  =  0.1938  and 
F(35)  =  0.2423,  so  this  gives  a  simulated  value  of  x  =  35. 


The  second  stage  is  to  simulate  35  positions  Plt  P2, ...,  P35,  generated  by  a  two- 
dimensional  Poisson  process  and  falling  within  a  20  x  20  square  area.  The 
position  P  of  any  object  is  equally  likely  to  be  at  any  point  in  the  square  ABCD 
(see  Figure  2.6),  so  AM  and  AN  are  independent  random  values  from  the  uniform 
distributions  on  lengths  AB  and  AD  respectively:  in  other  words,  AM/AB  and 
AN /AD  are  independently  distributed  as  U{  0, 1). 

Suppose  we  obtain  the  random  values  of  u  by  reading  down  column  3  of  the  table 
on  page  42  of  Neave.  So  for  the  first  position,  Px, 

AM  JAB  =  0.463  and  ANJAD  =  0.004. 

To  obtain  the  coordinates  of  Pu  multiply  by  20:  they  are  (9.26,  0.08).  The  next 
position,  P2,  is  independent  of  P{  and  is  simulated  in  exactly  the  same  way: 
AM2/AB  =  0.230,  ANJAD  =  0.567.  This  procedure  is  continued  until  the  positions 
of  all  35  objects  have  been  simulated.  These  values  are  shown  in  Table  2.1  and  the 
positions  are  plotted  in  Figure  2.7. 


The  value  of  the  parameter  X  is 
immaterial  from  now  on:  we  know 
how  many  objects  there  are.  All  that 
remains  is  to  find  where  they  are 
located. 


Table  2.1 

Positions  of  35  objects 

Object,  Pf 

AM  JAB 

ANJAD 

Coordinates  in 

A 

1  M 

B 

20  x  20  square 

Figure  2.6 

P\ 

0.463 

0.004 

(9.26,0.08) 

Pi 

0.230 

0.567 

(4.60, 11.34) 

p3 

0.835 

0.029 

(16.70,0.58) 

p* 

0.310 

0.063 

(6.20,1.26) 

P5 

0.562 

0.453 

(11.24,9.06) 

Pc, 

0.943 

0.643 

(18.86,12.86) 

Pi 

0.763 

0.316 

(15.26,6.32) 

Ps 

0.298 

0.979 

(5.96, 19.58) 

P9 

0.790 

0.946 

(15.80,18.92) 

Pi  0 

0.315 

0.906 

(6.30,18.12) 

20 

P11 

0.883 

0.641 

(17.66,12.82) 

• 

• 

Pi  2 

0.979 

0.229 

(19.58,4.58) 

• 

Pit 

0.648 

0.255 

(12.96, 5.10) 

• 

Pi 4 

0.008 

0.069 

(0.16,1.38) 

Pi  5 

0.677 

0.989 

(13.54,19.78) 

• 

• 

Pie 

0.994 

0.780 

(19.88,15.60) 

15- 

• 

• 

Pi  7 

0.712 

0.175 

(14.24,3.50) 

• 

Pis 

0.293 

0.535 

(5.86, 10.70) 

Pl9 

0.843 

0.152 

(16.86,3.04) 

•  • 

P20 

0.009 

0.270 

(0.18,5.40) 

• 

Pll 

0.406 

0.007 

(8.12,0.14) 

• 

Pl2 

0.388 

0.718 

(7.76, 14.36) 

10- 

P23 

0.462 

0.356 

(9.24, 7.12) 

• 

Pi  4 

0.014 

0.028 

(0.28,0.56) 

• 

• 

P25 

0.038 

0.026 

(0.76,0.52) 

P26 

0.829 

0.227 

(16.58,4.54) 

• 

• 

Pll 

0.121 

0.865 

(2.42, 17.30) 

• 

P 28 

0.095 

0.319 

(1.90,6.38) 

5- 

• 

Pi  9 

0.498 

0.613 

(9.96, 12.26) 

P30 

0.543 

0.612 

(10.86, 12.24) 

• 

• 

P31 

0.856 

0.406 

(17.12,8.12) 

P32 

0.233 

0.759 

(4.66,15.18) 

•  • 

P33 

0.624 

0.820 

(12.48, 16.40) 

•  • 

- , _ *_ 

_  • 

• 

P34 

0.020 

0.421 

(0.40, 8.42) 

5 

10  15 

- » 

P35 

0.063 

0.806 

(1.26,16.12) 

L\) 

Figure  2.7  A  simulation  of  a  two-dimensional  Poisson 

process 

This  figure  represents  the  required  simulation  of  a  Poisson  process  with  density  0.1 
over  a  square  region  of  area  20  units  x  20  units.  This  is  also  referred  to  as  a 

random  pattern  or  a  random  spatial  pattern.  □ 
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Example  2.5  A  random  pattern  on  a  rectangle 

The  same  method  as  was  used  in  Example  2.4  can  be  used  to  generate  a 
simulation  of  a  random  pattern  on  a  rectangle  of  size  a  x  b.  The  number  of  objects 
is  determined  by  simulating  an  observation  from  a  Poisson  distribution  with 
parameter  Xab.  The  positions  are  simulated  by  taking  random  observations  from 
U( 0, 1)  in  pairs,  multiplying  one  by  a  and  the  other  by  b.  □ 

Question  2.4  Explain  how  you  would  simulate  a  random  pattern  with  a  given 
density  over  an  irregular  area  such  as  a  map  of  the  United  Kingdom.  □ 

Transects  across  random  patterns 

Figure  1.3  showed  a  spatial  pattern  of  white  oak  trees  in  a  wood.  A  transect  was 
marked  across  the  region  and  a  one-dimensional  pattern  was  obtained  from  those 
objects  which  fell  in  the  transect.  This  was  illustrated  in  Figure  1.4.  Clearly,  the 
properties  of  a  one-dimensional  pattern  along  a  transect  are  related  to  the  two- 
dimensional  pattern  on  which  the  transect  is  drawn.  If  the  spatial  pattern  is 
random  (that  is,  if  it  is  a  two-dimensional  Poisson  process),  then  the  pattern 
obtained  from  the  transect  is  also  random  (a  one-dimensional  Poisson  process),  as 
will  now  be  shown. 

Suppose,  in  fact,  that  we  have  a  two-dimensional  Poisson  process  with  density  X 
on  some  region,  and  that  a  transect  of  width  h  is  marked  on  it  in  any  direction 
(see  Figure  2.8). 


Figure  2.8 

The  value  of  h  need  not  necessarily  be  small,  though  in  practical  situations  it 
usually  is.  Some  of  the  objects  in  the  spatial  Poisson  process  will  fall  within  the 
transect;  call  these  Px,  P2,  ...  .  For  each  such  object  Ph  the  projected  point  Qt  can 
be  marked  on  a  fixed  line  AB  parallel  to  the  transect  (any  such  line  will  serve  the 
purpose).  The  points  <2,  form  a  one-dimensional  spatial  process  and  will  be 
measured  from  some  convenient  origin  0  on  the  line  AB,  so  that  OQt  =  x;.  The 
points  Qi  satisfy  the  three  postulates  of  a  one-dimensional  Poisson  process  in 
space,  as  follows. 

(I)  Let  the  points  on  AB  at  distances  x,  x  +  Sx  from  O  be  M,  N  respectively  and 
let  MX,NX,M2,N2  be  the  corresponding  points  on  the  boundaries  of  the 
transect,  as  shown  in  Figure  2.8.  Then 

P(one  point  Qt  occurs  in  [x,  x  +  5x]) 

=  P(one  object  Pt  occurs  in  the  rectangle  M[N1N2M2 ) 

=  Xh  dx  +  o{h  <5x) 

=  Xh  Sx  +  o((5x),  0(/j  <5X)  is  same  as  0(<5x). 

since  the  area  MlN1N2M2  =  hdx,  and  Postulate  I  of  the  two-dimensional 
Poisson  process  applies. 
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(II)  The  probability  that  two  or  more  points  Qt  occur  in  [x,x  +  <5x]  is  equal  to 
the  probability  that  there  are  two  or  more  objects  in  the  rectangle 
MlN1N2M2.  By  Postulate  II  of  the  two-dimensional  process,  this  probability 
is  o(h  Sx)  or  o(r5x). 

(III)  The  occurrence  of  projected  points  Q.x  after  any  point  x  on  the  line  AB  is 
independent  of  the  occurrence  of  projected  points  before  the  point  x,  because 
the  positions  of  objects  in  non-overlapping  regions  of  the  two-dimensional 
process  are  independent. 

So  all  the  postulates  for  a  one-dimensional  Poisson  process  are  satisfied  by  the 
point  process  on  AB.  This  can  be  summarized  as  follows. 


The  pattern  of  projected  points  in  any  transect  of  width  h  in 
a  random  spatial  pattern  with  density  X  is  a  one-dimensional 
Poisson  process  with  density  Xh. 


Note  the  dimensions  of  the  parameters  of  the  two  processes:  X  per  unit  area  and 
Xh  per  unit  length.  The  factor  h  reflects  the  change  in  dimensions  from  area  to 
length. 

In  this  section  a  random  spatial  pattern  in  continuous  space  has  been  defined  as 
one  that  can  be  modelled  by  a  two-dimensional  Poisson  process.  From  Figure  2.7 
you  can  see  that  the  objects  are  by  no  means  regularly  spaced;  for  example,  there 
are  three  objects  clustered  together  in  the  bottom  left-hand  corner,  whereas  a  little 
way  up  and  to  the  right  of  that  corner  there  is  a  region  containing  no  objects. 

There  are  several  ways  of  testing  statistically  whether  or  not  a  particular  pattern 
can  be  regarded  as  a  random  pattern.  It  can  be  shown  that  the  pattern  of  redwood 
seedlings  in  region  I  of  Figure  1.6  can  be  regarded  as  random,  but  the  pattern  in 
region  II  is  not  random.  You  can  see  that  those  seedlings  show  considerable 
clustering.  The  pattern  of  Cataglyphis  nests  in  Figure  1.7  can  also  be  modelled  by 
a  random  spatial  process,  but  the  positions  of  the  Messor  nests  are  too  regular  to 
be  consistent  with  the  Poisson  process  assumptions. 

Patterns  with  regularity  and  patterns  with  clustering  are  the  two  main  types  of 
spatial  pattern  that  are  not  random.  In  the  next  section  we  shall  develop  some 
models  for  these  types  of  pattern. 


3  Non-random  spatial  patterns 

3.1  Patterns  with  regularity 

There  are  many  spatial  patterns  in  which  the  objects  are  more  uniformly 
positioned  than  in  a  random  pattern,  and  these  are  known  as  patterns  with 
regularity.  They  arise  in  a  variety  of  situations  which  lead  to  different  kinds  of 
regularity,  and  hence  different  models  are  required.  In  Figure  1.7,  the  pattern  of 
Messor  ants’  nests  shows  too  much  regularity  to  be  regarded  as  random;  the  likely 
reason  for  this  is  that  each  nest  has  its  own  foraging  territory,  so  that  there  is  a 
tendency  for  the  inhabited  area  to  be  shared  out  between  the  nests.  The  London 
post  offices  shown  in  Figure  1.5  also  appear  to  have  a  fairly  regular  pattern.  This 
is  due  to  human  planning.  If  post  offices  were  sited  according  to  a  two- 
dimensional  Poisson  process,  then  sometimes  two  post  offices  would  be  positioned 
very  close  to  each  other  and  this  would  obviously  not  be  sensible.  The  ants’  nests 
and  the  post  offices  both  occur  in  patterns  with  regularity  for  similar  reasons. 
There  are  also  other  types  of  situation  which  give  rise  to  patterns  with  regularity, 
and  these  will  be  discussed  later  in  the  subsection  as  various  models  are 
introduced. 


The  extreme  case  is  that  of  complete  regularity,  when  the  objects  are  laid  out 
exactly  in  a  uniform  pattern.  In  one  dimension,  this  simply  means  that  the  objects 
are  uniformly  spaced;  for  example,  telephones  and  street  lamps  are  both  regularly 
spaced  along  a  motorway,  and  the  ten  hurdles  in  a  110-metre  race  are  positioned 
with  exactly  9.14  metres  between  consecutive  hurdles.  In  two  dimensions,  the 
objects  occur  at  the  vertices  of  a  lattice  which  is  built  up  of  regular  polygons.  The 
most  obvious  form  of  lattice  is  the  square  lattice  (see  Figure  3.1(a)),  which  can  be 
thought  of  as  being  constructed  by  fitting  together  equal-sized  squares.  There  are 
only  two  other  possible  regular  lattice  patterns — the  triangular  lattice  (Figure 
3.1(b)),  which  is  made  up  of  equilateral  triangles,  and  the  hexagonal  lattice  (Figure 
3.1(c)),  which  is  composed  of  regular  hexagons. 


Figure  3.1  The  three  regular  lattice  patterns 


Trees  planted  in  an  orchard  and  plants  in  a  nursery  provide  familiar  examples  of 
completely  regular  patterns;  square  and  triangular  lattices  arc  the  most  common. 
Figure  3.2  shows  a  photograph  of  cloned  oil  palm  trees  which  are  planted  in  a 
completely  regular  triangular  lattice  pattern. 


Figure  3.2  Cloned  oil  palm  trees 
4wf V  }v\j 

In  Section  5-of  jMk-2  you  studied  the  pattern  in  time  created  by  regular  arrivals 
with  perturbations,  which  is  a  model  for  events  which  are  scheduled  to  occur 
regularly  but  where  the  time  of  each  event  is  subject  to  random  error;  for  example, 
trains  do  not  arrive  exactly  when  scheduled.  Street  lamps  along  a  town  road, 
which  sometimes  are  not  regularly  spaced  because  of  side  roads  or  other 
obstructions,  would  have  a  similar  one-dimensional  spatial  pattern.  A  similar 
model  can  be  used  for  a  two-dimensional  spatial  pattern.  Suppose,  for  example, 
that  bushes  are  meant  to  be  planted  according  to  a  plan  which  positions  them  at 
the  vertices  Vx,  V2,  ...  of  (say)  a  square  lattice.  In  practice  they  may  not  be 
positioned  exactly  at  Fj,F2,  ...,  but  instead  at  positions  PX,P2,  ...  slightly 
displaced  from  Vu  V2,  ...  (see  Figure  3.3).  The  pattern  of  bush  positions  will  not  be 
completely  regular,  but  it  will  be  much  more  regular  than  a  random  pattern.  This 
is  an  example  of  a  pattern  possessing  regularity,  but  not  complete  regularity. 


Source:  Dr  L.  Jones,  Unilever 
Research. 
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Figure  3.3  A  lattice  pattern  with  independent 
random  displacements 

To  model  such  a  pattern,  it  is  necessary  to  model  the  displacements 

^2^2 1  •••  ,  as  regards  both  their  magnitudes  and  their  directions.  A  natural 
assumption  might  be  that  the  directions  are  independent  and  uniformly  distributed 
from  0  to  2n,  and  that  the  magnitudes  (i.e.  the  lengths  Fj Pu  V2P2,  ...)  are 
independent  of  each  other  and  of  the  directions,  and  have  some  common 
distribution  (e.g.  exponential).  This  type  of  pattern  is  called  a  lattice  pattern  with 
independent  random  displacements. 

Another  regular  pattern  which  can  arise  in  the  orchard  or  plantation  context  is 
when  the  trees  were  indeed  planted  in  a  lattice  formation,  but  some  of  them  have 
subsequently  died.  One  modelling  assumption  might  be  that  death  occurs 
independently  from  tree  to  tree  among  those  originally  planted,  with  a  small 
probability  p  of  occurrence,  which  is  the  same  for  each  original  tree.  We  would 
then  have  a  lattice  pattern  with  random  deaths. 

Question  3.1  If  the  original  pattern  of  trees  is  a  square  lattice,  and  the  deaths 
occur  randomly  as  just  described,  what  is  the  spatial  pattern  of  the  surviving 
trees?  □ 

Question  3.2  Trees  are  planted  in  a  lattice  formation  in  an  orchard.  Some  years 
later,  5%  of  them  have  died.  Why  might  the  above  model  of  random  deaths  be 
unsatisfactory  as  a  representation  of  what  has  occurred?  □ 

The  models  that  have  been  developed  so  far  for  patterns  with  regularity  are 
modifications  of  completely  regular  lattice  patterns.  Models  for  patterns  with 
regularity  can  also  be  developed  in  other  ways.  One  of  the  most  important  causes 
of  regularity  is  inhibition  between  objects  which  are  near  to  one  another:  it  is 
sometimes  physically  impossible  for  two  objects  in  close  proximity  to  coexist.  For 
example,  two  trees  cannot  grow  very  close  to  each  other.  It  is  this  property  of 
inhibition  that  provides  the  basis  of  the  other  model  for  patterns  with  regularity 
that  will  now  be  developed. 

In  Subsection  1.1  of  Unit  2,  a  model  for  inhibition  of  events  occurring  in  time  was 
described  briefly.  When  a  Geiger  counter  is  recording  radioactive  particles,  there  is 
a  short  ‘dead’  period  after  a  particle  is  recorded,  during  which  any  further  particles 
arriving  are  ignored.  It  is  assumed  that  particles  arrive  according  to  a  Poisson 
process,  but  no  particle  is  recorded  that  arrives  within  a  time  t,  say,  of  a 
previously  recorded  particle.  So  the  eventual  pattern  of  events  is  one  from  which 
gaps  of  less  than  t  are  omitted.  The  simplest  two-dimensional  model  of  a  spatial 
pattern  with  inhibition  is  a  direct  extension  of  this  model;  it  is  known  as  a  simple 
sequential  inhibition  process. 

Suppose,  for  example,  that  plants  grow  in  chronological  succession  from  seeds 
randomly  scattered  by  wind  or  other  agencies.  If  all  the  seeds  were  to  produce 
viable  plants,  these  would  be  positioned  over  the  area  in  a  random  pattern.  In  a 
two-dimensional  Poisson  process,  objects  can  be  nearer  together  than  any  given 
distance,  however  small.  But  a  new  plant  may  well  be  unable  to  survive  if  too  near 
to  plants  already  growing.  When  this  is  so,  the  loss  of  plants  which  are  inhibitively 
near  to  other  plants  produces  an  element  of  regularity  in  the  spatial  pattern.  This 
is  the  model  for  the  simple  sequential  inhibition  process. 

The  reason  for  the  name  is  now  apparent.  It  is  sequential  because  objects  are 
added  one  at  a  time  to  the  pattern,  and  it  has  inhibition  because  an  object  is 
prevented  or  inhibited  from  belonging  to  the  pattern  if  there  is  another  object  too 
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near  to  it.  (The  reason  for  the  word  simple  will  become  clear  at  the  end  of  this 
subsection,  when  other  sequential  inhibition  processes  are  described  briefly.)  A 
simple  sequential  inhibition  process  is  simulated  in  the  following  example. 

Example  3.1  A  simple  sequential  inhibition  process 

A  random  pattern  of  35  points  P\ »  •  •  •  >  P35  was  simulated  in  a  20  units  by  20  units 
square  region  in  Example  2.4.  The  35  objects  were  plotted  in  Figure  2.7,  and 
Table  2.1  showed  their  positions  (for  example,  the  coordinates  of  Pi  are 
(9.26,0.08)).  Suppose  now  the  condition  is  imposed  that  no  object  may  lie  nearer 
than  3  units  to  an  object  previously  simulated.  The  points  are  then  plotted  one  at 
a  time  in  the  order  given  in  Table  2.1,  rejecting  any  point  P{  which  is  nearer  than  3 
units  to  a  point  previously  plotted.  Points  Pl5  ...,  P9  are  all  accepted,  but  P10  is 
too  close  to  P8  and  so  is  rejected: 

P8Pi0  =  VC(5.96  -  6.30)2  +  (19.58  -  18.12)2] 

=  1.50 
<  3. 

Proceeding  in  this  manner,  18  points  remain  in  the  pattern:  Pl5  P2,  P3,  P4,  P5,  P6, 
Pi,  P 8 >  P( )•>  Pi2,  Pxa,  P201  P22,  P21 ,  P295  F32,  F33,  ^34»  shown  in  Figure  3.4.  Note 
that  once  a  point  has  been  rejected,  it  takes  no  further  part  in  the  simulation.  This 
is  a  sensible  procedure,  for  if  a  plant  does  not  survive,  a  new  plant  will  not  be 
inhibited  from  growing  near  the  position  originally  occupied  by  the  dead  plant.  In 
the  example  from  Table  2.1,  P28P2A  =  2.53  but  P2 8  was  rejected  because  it  was  too 
close  to  P20.  The  distance  P20P34  =  3.03  so  P34  is  included.  □ 
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Figure  3.4  A  realization  of  a  simple  sequential  inhibition  process 

Question  3.3  The  proportions  for  Example  2.4,  shown  in  Table  2.1,  were 
simulated  by  reading  digits  down  the  columns  of  the  table  on  page  42  of  Neave. 
The  place  for  the  simulation  of  ^35  was  half-way  down  column  6:  0,6, 3, 8,0,6. 
Carry  on  from  this  point  in  the  table  to  simulate  one  further  point  that  could  be 
plotted  in  Figure  3.4.  □ 

Suppose  that  the  minimum  distance  permitted  between  objects  in  a  simple 
sequential  inhibition  process  is  d.  Then  an  alternative  definition  of  the  process  can 
be  given  in  terms  of  discs  of  diameter  d.  The  discs  are  placed  one  at  a  time  in  the 
plane  with  their  centres  at  the  positions  of  a  two-dimensional  Poisson  process;  if  at 
any  stage  a  disc  would  overlap  a  disc  already  in  position,  it  is  rejected.  This 
procedure  is  equivalent  to  the  pattern-generating  mechanism  of  Example  3.1  with  d 
as  the  minimum  separation.  Figure  3.5  shows  the  pattern  of  Figure  3.4  with  circles 
of  diameter  3  drawn  round  each  object.  You  can  see  that  the  space  is  getting  ‘full’; 
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not  many  more  circles  could  be  drawn  in  Figure  3.5.  Once  the  density  becomes 
high  enough,  the  pattern  will  appear  fairly  regular. 


Figure  3.5 


The  word  ‘simple’  in  the  title  of  the  simple  sequential  inhibition  process  reflects  the 
all-or-nothing  nature  of  the  minimum  separation  condition  as  it  is  applied  in  this 
process.  Placing  two  objects  at  a  separation  of  less  than  d  is  impossible;  but 
placing  at  a  separation  greater  than  d,  even  if  not  much  greater  than  d,  is 
unrestricted.  In  many  practical  situations,  however,  inhibition  operates  in  a  less 
cut-and-dried  way.  If  a  bush  is  planted  at  distance  r  units  from  an  existing  bush, 
the  new  bush  might  be  certain  to  die  if  r  is  too  small  (say  r  <  rx)  and  certain  to ’ 
survive  if  r  is  sufficiently  large  (say  r  >  r2);  for  intermediate  values  of  the 
separation  (rx  <r  <  r2),  survival  might  be  a  matter  of  chance.  This  spatial  process 
is  known  as  a  sequential  inhibition  process.  It  could  be  modelled  by  specifying  as  a 
function  of  r  the  probability  p(r)  that  the  new  bush  survives:  p(r)  =  0  for  r  <  rlt 
p(r)  increases  from  0  to  1  in  some  specified  way  as  r  increases  from  rx  to  r2,  and 
p(r)  =  1  for  r  >  r2.  A  typical  form  of  the  function  p(r)  is  shown  in  Figure  3.6(a).  In 
the  simple  sequential  inhibition  process,  rx=r2  =  d  and  p(r)  is  a  step  function,  as 
shown  in  Figure  3.6(b). 


Figure  3.6 


In  a  sequential  inhibition  process,  the  pattern  is  built  up  object  by  object, 
sequentially.  This  is  actually  a  random  process  in  both  space  and  time.  The  spatial 
pattern  changes  with  time.  However,  most  of  the  spatial  patterns  in  this  unit  are 
assumed  to  be  fixed  throughout  time,  and  time  is  not  an  essential  feature  of  a 
process  with  inhibition.  Suppose,  for  example,  that  seeds  are  randomly  scattered  by 
the  wind  in  autumn  and  germinate  the  following  spring.  If  two  of  the  seeds  fall 
close  together,  only  one  will  be  able  to  develop  into  an  adult  plant,  and  the  other 


will  die.  To  model  this  pattern,  if  two  objects  in  a  two-dimensional  Poisson 
process  are  less  than  some  distance  d  apart,  one  of  the  objects  is  eliminated 
(selected  by  a  random  procedure).  The  model  needs  to  be  specified  carefully  when 
there  are  three  or  more  objects  close  together,  but  this  aspect  will  not  be  discussed 
in  this  course. 

In  this  subsection,  some  probability  models  have  been  developed  for  patterns  with 
regularity  where,  for  one  reason  or  another,  objects  cannot  lie  very  close  together 
and  so  they  have  a  tendency  to  be  fairly  regularly  spaced.  In  the  following 
subsection,  models  will  be  developed  for  patterns  which  show  the  opposite  kind  of 
departure  from  random  patterns. 


3.2  Patterns  with  clustering 

In  this  subsection,  models  will  be  developed  for  spatial  patterns  in  which  objects 
are  more  likely  to  occur  closer  together  than  they  would  be  in  a  random  pattern, 
so  that  there  is  a  tendency  for  them  to  appear  in  groups  or  clusters.  Such  patterns 
are  known  as  patterns  with  clustering.  The  redwood  seedlings  in  region  II  of 
Figure  1.6  provide  an  example  of  a  clustered  pattern. 

Clustering  may  arise  for  various  reasons.  In  a  botanical  context,  there  are  two 
well-recognized  causes  of  clustering  of  plants.  These  are  described  in  an  article  by 
E.  C.  Pielou: 

On  the  one  hand  seeds  may  fall  at  random  over  an  area  but  if  the  habitat  is  not 
homogeneous  the  proportion  germinating  and  thriving  will  vary  from  site  to  site  so 
that  the  density  is  high  in  some  sites  and  low  in  others.  On  the  other  hand  the 
habitat  may  be  homogeneous  but  the  individual  plants  may  occur  in  family  groups 
owing  to  the  fact  that  they  reproduce  vegetatively  or  by  seeds  with  small  radius  of 
dispersal. 

Pielou  calls  these  two  types  of  clustered  patterns  heterogeneity  of  habitat  and 
reproductive  clumping,  respectively. 

Clustered  patterns  also  arise  in  other  contexts,  such  as  human  geography  and 
animal  behaviour.  Towns  on  a  map  of  the  United  Kingdom  occur  in  clusters.  This 
is  because  towns  develop  in  suitable  locations  such  as  by  rivers,  and  not  in  very 
mountainous  country.  This  resembles  the  botanical  situation  of  heterogeneity  of 
habitat.  If  the  spatially  distributed  objects  are,  for  example,  herbivorous  animals 
such  as  cows  or  kangaroos  grazing  freely  over  a  large  area,  they  do  not  graze  as 
isolated  individuals  but  ‘socially’  in  a  number  of  groups.  Again,  if  the  objects  are 
retail  shops  of  all  kinds  in  the  suburban  regions  of  a  city,  these  will  tend  to  be 
clustered  together  in  various  main  and  subsidiary  shopping  centres.  In  these 
situations  there  is  association  between  the  individuals  (shops,  kangaroos,  or 
whatever)  belonging  to  the  same  cluster.  This  resembles  the  botanical  situation  of 
reproductive  clumping.  In  three  dimensions,  stars  do  not  occur  randomly  in  space 
but  in  clusters  or  galaxies. 

Heterogeneity  of  habitat:  models  for  patterns  with  clusters 

Non-homogeneous  Poisson  processes  modelling  events  in  time  were  described  in 
Subsection  44  of  Unit  2.  They  are  Poisson  processes  in  which  the  rate  of 
occurrence  o’f  events  is  a  known  function  of  time,  2(t).  This  sort  of  model  can  also 
apply  in  two  dimensions,  so  that  the  density  of  objects  is  a  function  of  position. 
This  would  be  appropriate  for  a  model  of  seeds  germinating  over  an  area  if  it  were 
possible  to  specify  how  the  fertility  of  the  land  varied;  for  example,  it  might 
increase  from  north  to  south  and  also  from  east  to  west.  Then  at  places  of  high 


Source:  E.  C.  Pielou,  ‘A  single 
mechanism  to  account  for  regular, 
random  and  aggregated 
populations’.  Journal  of  Ecology, 
vol.  48  (1960)  p.  575. 
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fertility,  the  value  of  the  density,  X,  is  high,  so  there  will  tend  to  be  more  objects 
occurring  in  the  Poisson  process,  and  this  will  lead  to  a  cluster  of  points.  The 
theory  of  the  non-homogeneous  Poisson  process  can  be  developed  in  two 
dimensions  in  a  similar  way  to  one  dimension,  as  was  done  in  Unit  2,  but  it  will 
not  be  studied  in  this  course. 

Another  model,  described  briefly  in  Subsection  £4-  of  Unit  2,  is  the  doubly- 
stochastic  Poisson  process,  where  the  rate,  A,  is  a  random  variable.  This  can  also 
be  extended  to  the  two-dimensional  Poisson  process.  This  model  is  also  used  to 
describe  patterns  with  clusters,  but  will  not  be  discussed  further  in  this  course. 

Reproductive  clumping:  models  for  patterns  with  clusters 

The  models  that  are  described  below  are  two-dimensional  analogues  of  the  model 
described  i^-Sirbseetion-3. 1  for  a  compound  Poisson  process,  in  which 

compound  events  occur  according  to  a  Poisson  process,  and  multiple  occurrences 
of  some  other  event  are  associated  with  each  compound  event.  The  analogy  is  even 
closer  with  the  model  for  traffic  flow  described  in  Subsection  3.3  of  Unit  2;  there  a 
queue  of  cars  followed  each  slow-moving  vehicle,  whereas  here  objects  are 
clustered  round  some  point. 

Consider  a  pattern  composed  of  clusters  of  objects  which  are  in  some  way 
dependent  on  each  other.  Each  cluster  may  be  a  group  of  cows  grazing  together  or 
a  number  of  plants  which  have  reproduced  vegetatively  from  a  parent  plant.  In 
each  cluster  there  is  assumed  to  be  a  cluster  centre  which,  in  this  model,  is  not 
itself  the  position  of  an  object  but  only  a  geometrical  point.  This  pattern  is 
modelled  by  assuming  that  the  cluster  centres  are  distributed  according  to  a  two- 
dimensional  Poisson  process.  The  number  of  objects  located  round  each  cluster 
centre  is  a  random  variable,  T,  which  is  discrete  and  positive,  for  there  must  be  at 
least  one  object  in  each  cluster.  The  positions  of  the  objects  in  each  cluster  are 
usually  randomly  displaced  from  the  cluster  centre.  Both  the  distance  from  the 
cluster  centre  and  the  direction  are  random  variables.  This  general  model  is  called 
a  randomly-positioned  clusters  model;  it  includes  a  wide  range  of  particular  models 
with  different  specific  distributions  and  different  parameter  values,  as  follows. 

(i)  The  cluster  centres  are  distributed  according  to  a  Poisson  process,  with 
density  X. 

(ii)  The  number  of  objects  round  each  cluster  centre  is  a  realization  of  a  variate, 

Y,  whose  distribution  can  take  many  forms,  such  as  geometric,  or  Poisson  +  1. 
The  mean  of  this  distribution  can  be  varied  to  model  the  mean  observed 
cluster  size.  Another  possibility  is  that  Y  takes  a  fixed  value,  y.  For  example, 

y  =  2  might  roughly  model  the  pattern  of  ‘objects’  relaxing  in  a  large, 
uncrowded  park  on  a  warm  holiday  afternoon. 

(iii)  The  distribution  of  displacement  lengths,  R{>  0),  from  cluster  centre  to 
individual  objects  can  take  many  forms.  It  is  assumed  that  this  distribution  is 
the  same  for  each  object  and  that  all  distances  are  independent.  The  variate  R 
can  take  any  continuous  or  discrete  form  over  non-negative  values.  Common 
distributions  include  the  exponential,  and  also  the  Rayleigh,  which  will  be 
discussed  in  the  next  section.  If  the  mean  of  R  is  small  compared  to  the  mean 
distance  between  cluster  centres,  then  the  clusters  are  usually  obvious  and  well 
separated  from  each  other;  if  the  mean  of  R  is  as  large  as,  say,  30%  of  the 
mean  distance  between  centres,  then  the  clusters  will  overlap  and  not  be  very 
obvious  from  the  pattern. 

(iv)  The  displacement  direction  of  the  objects  from  the  cluster  centre  is  nearly 
always  taken  to  be  uniform,  U( 0, 2n). 


The  clusters  are  the  same  as  the 
'family  groups'  in  the  above 
quotation  from  E.  C.  Piclou. 


That  is,  Y  =  N  +  1  where  N  has 
Poisson  distribution. 


Figure  3.7  shows  a  simulated  randomly-positioned  cluster  pattern  in  which  the 
cluster  size  is  N  +  1,  where  N  has  a  Poisson  distribution.  (The  cluster  size  must  be 
at  least  one.)  The  variate  R  has  a  Rayleigh  distribution. 


Figure  3.7  A  realization  of  a  spatial  pattern  with  clustering 


Source:  Peter  J.  Diggle,  Julian  Besag 
and  J.  Timothy  Gleaves,  ‘Statistical 
analysis  of  spatial  point  patterns  by 
means  of  distance  methods’, 
Biometrics,  vol.  32  (1976)  p.  659. 


This  model  can  be  formulated  in  a  slightly  different  way  so  that  each  cluster  centre 
is  itself  the  position  of  an  object.  This  is  essentially  the  same  model  as  the  previous 
one;  the  way  in  which  it  is  formulated  reflects  situations  when  the  cluster  centre 
(head  office,  plant)  is  in  some  sense  a  parent,  and  the  cluster  (branch  offices,  plants 
formed  by  vegetative  reproduction)  comprises  offspring. 

In  the  last  two  sections,  many  models  for  spatial  patterns  which  are  random, 
regular  or  clustered  have  been  described.  In  the  next  section  we  shall  develop  some 
of  the  mathematical  properties  of  some  of  these  models,  and  see  what  information 
we  can  deduce  about  the  patterns. 


4  Count  and  distance  properties 


In  Unit  2  it  was  noted  that:  ‘There  are  usually  two  main  features  of  interest  in  a 
point  process:  the  number  of  events  that  occur  in  a  fixed  time  interval,  and  the 
time  that  elapses  between  consecutive  events  or  between  every  rth  event.’  If  we  are 
discussing  a  spatial  process  of  objects  along  a  line  as  in  Figure  1.1(a)  or  Figure  1.4, 
instead  of  a  process  of  events  in  time,  then  the  language  needs  adjusting.  The 
features  of  interest  now  are  the  number  or  count  of  objects  that  lie  in  a  given 
interval  along  the  line  and  the  distance  between  consecutive  objects  or  between 
every  rth  object.  Note,  however,  that  there  is  an  essential  difference  between  the 
patterns  of  events  in  time  and  those  in  a  one-dimensional  spatial  pattern.  In  time 
there  is  a  definite  ordering  so  that  one  event  follows  another;  the  ‘next’  event  is 
unambiguous — it  is  the  one  ‘to  the  right’.  In  space  there  are  two  objects  ‘next’  to 
each  object — one  on  each  side. 

Let  us  examine  these  alternative  features,  counts  and  distances  in  one  dimension, 
in  a  little  more  detail.  Along  the  line  surveyed  is  a  pattern  of  points  P1,P2,  ... 
which  are  the  positions  of  objects  as  illustrated  in  Figure  4.1(a).  As  a  reference 
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framework,  a  grid  of  points  0U02,  ...  can  be  marked  on  the  line:  see  Figure 
4.1(b).  This  grid  divides  the  line  up  into  equal  intervals  0102,0203,  ...,  labelled 
/i,/2, ....  We  can  study  the  pattern  either  in  terms  of  the  counts  of  objects  in  the 
grid  intervals  IUI2,  ...,  as  shown  in  Figure  4.1(c),  or  in  terms  of  the  distances 
between  consecutive  objects,  P1P2,P2P3,  ...,  as  shown  in  Figure  4.1(d). 

(a)  - • - . - .. - . - . _ . _ . _ . 

P*  P2  P>P  4  PS  Pb  P7  P8  P9 

(b)  - T - ^ - T - ^ - T - ^ - T - lA - T _ k _ T _ 

o2  o3  o4  o5  o6 

2  "  T  0  3  1  2 

(d)  p, - p2 

Pz - Pz 


Figure  4.1 

What  about  spatial  patterns  in  two  dimensions?  Do  the  above  ideas  about  counts 
and  distances  still  apply?  Consider  a  pattern  of  objects  PUP2,  ...  in  the  plane.  For 
illustrative  purposes  we  shall  focus  on  the  simulated  pattern  of  35  objects  in 
Figure  2.7.  In  the  one-dimensional  case  we  superposed  on  the  pattern  a  linear  grid 

which  divided  the  line  into  equal  intervals  IUI2, _ We  can  now  use  its  two- 

dimensional  equivalent,  a  square  grid  which  divides  the  surveyed  area  into  equal 
squares  or  quadrats.  Each  quadrat  has  the  same  area,  A  say. 


Figure  4.2  A  random  spatial  pattern  with  a  superposed  grid 


Table  4.1  Counts  of  objects 


This  procedure  provides  a  satisfactory  extension  from  counts  of  objects  in  an 
interval  to  counts  in  a  quadrat;  but  what  about  distances  between  consecutive 
objects?  Here  no  direct  extension  from  one  to  two  dimensions  is  possible.  This  is 
because  objects  can  be  ordered  consecutively  along  a  line,  but  not  over  a  region. 
Thus  in  Figure  4.1(a),  P2  is  consecutive  to  Pu  P1P2  is  the  distance  between  them, 
and  the  pattern  of  9  objects  gives  8  such  distances  PXP2,  ...,  P8P9  from  the  first 
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Figure  4.2  shows  the  spatial  pattern  of  Figure  2.7  with  a  grid  of  quadrats 
superposed  on  it.  The  counts  of  objects  in  the  quadrats  are  shown  in  Table  4.1. 
These  counts  can  now  be  used  to  gain  an  insight  into  the  nature  of  the  pattern.  In 
Subsection  4.1  we  consider  some  properties  of  counts  for  various  types  of  spatial 
pattern. 


object,  Plt  to  the  last  one,  Pg  (or,  possibly,  the  first  object,  P9,  to  the  last  one,  PJ. 
But  there  is  no  first  or  last  object  among  a  pattern  of  objects  in  a  plane. 

It  is,  however,  possible  to  define  two  types  of  distance  measurement  which  give 
information  about  the  nature  of  the  spatial  pattern.  These  are  the  object-to-nearest- 
object  distance,  S,  and  the  point-to-nearest-object  distance,  R.  As  an  illustration, 
consider  again  the  spatial  pattern  of  Figures  2.7  and  4.2;  this  pattern  is  shown 
again  in  Figure  4.3  and  the  objects  are  now  labelled. 


Figure  4.3 


Choose  one  of  the  objects  at  random,  using  say,  the  first  two  digits  on  Neave 
page  42,  i.e.  02,  which  indicates  the  object  P2.  The  object  nearest  to  P2  is  clearly 
P18.  The  realization  of  the  random  variable  S  for  P2  can  be  calculated  using  the 
coordinates  given  in  Table  2.1.  The  distance  between  P2  (4.60, 11.34)  and  Pls 
(5.86, 10.70)  is 

PzPis  =  Vt(4.60  -  5.86)2  +  (1 1.34  -  10.70)2] 

=  1.41. 

Since  the  pattern  in  Figure  4.3  contains  35  points,  it  could  provide  35  S-values. 
However,  it  would  be  unreliable  to  measure  the  distances  to  nearest  objects  from 
objects  close  to  the  boundary;  there  could  well  be,  for  example,  an  object  outside 
the  boundary  closer  to  P16  than  is  P6. 


Since  the  original  pattern  of  Figure 
4.3  was  generated  sequentially,  there 
is  actually  an  ordering  of  these 
objects.  This  is  ignored  in 
the  calculation  of  S-distances  and 
^-distances. 


Question  4.1  In  Figure  4.3  the  object  nearest  to  P2  is  P18.  This  is  indicated  on  the 
diagram  by  the  arrow  on  the  line  joining  P2  t oP18.  What  is  the  object  nearest  to 
P18?  What  is  the  S-value  for  P18?  □ 

Question  4.2  If  Pj  is  the  object  nearest  to  Pt  in  a  spatial  pattern,  is  P(  necessarily 
the  object  nearest  to  Pp  If  yes,  explain  why;  if  no,  give  a  counterexample  from 
Figure  4.3.  □ 


Returning  again  to  Figure  4.3,  a  realization  of  R,  the  point-to-nearest-object 
distance  is  obtained  by  choosing  at  random  any  point  in  the  area.  (By  ‘point’  is 
meant  a  geometrical  point.  With  probability  1,  this  will  not  be  the  position  of  any 
object.)  There  are,  of  course,  many  ways  of  doing  this.  By  continuing  sampling 
along  the  top  row  of  the  table  in  Neave,  the  next  digits  are  484881,  so  the 
coordinates  of  the  randomly  chosen  point,  C,  are  (0.484  x  20,0.881  x  20)  or 
(9.68, 17.62).  The  point  C  is  shown  in  Figure  4.3,  and  the  nearest  object  is  P33 
(12.48, 16.40).  So  the  observed  P-value  is  3.05. 

Different  types  of  spatial  pattern  give  rise  to  very  different  distributions  of  counts 
and  also  to  different  types  of  distribution  for  the  random  variables  S  and  R.  In 
Subsection  4.1  we  shall  study  some  properties  of  counts,  and  in  Subsection  4.2  we 
shall  study  the  properties  of  S  and  R,  for  various  spatial  patterns. 
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4.1  Counts  of  objects  and  their  properties 


Suppose  a  region  of  area  A  is  marked  out  in  the  plane  inhabited  by  objects  in  a 
spatial  pattern;  this  region  could  be  a  square  quadrat  or  have  some  other  shape. 
Denote  by  X  the  number  of  objects  in  the  pattern  falling  in  the  area  A.  In  this 
subsection  some  properties  of  the  distribution  of  X  will  be  derived  for  various 
types  of  spatial  pattern. 

Random  patterns 

For  a  two-dimensional  Poisson  process  with  density  X,  it  has  already  been  noted, 
in  Subsection  2.2,  that  X,  the  number  of  objects  in  regions  of  area  A,  has  a 
Poisson  distribution  with  parameter  XA.  So  the  mean  and  variance  of  X  are  given 
by 

E(X)  =  XA,  V(X)  =  XA. 

Just  as  for  events  in  time,  the  equality  of  the  mean  and  variance  for  this  process 
provides  a  useful  criterion  for  comparison  of  spatial  patterns.  The  ratio  of  the 
variance  of  the  random  variable  X  to  its  mean  is  known,  as  for  events  in  time,  as 

the  index  of  dispersion  /: 

r  =  Vi*) 

E(xy 

For  the  Poisson  process,  /  =  1.  If  /  <  1,  then  there  is  less  dispersion  than  for  a 
Poisson  process,  so  we  should  expect  more  regularity  in  the  pattern;  /  >  1  implies 
more  dispersion  or  scattering  of  the  points,  which  would  suggest  some  clustering. 
The  quantity  /  may  be  independent  of  the  value  of  A,  but  in  some  patterns  it  will 
be  a  function  of  the  size  of  area  under  study.  This  feature  was  also  true  of  events 
in  time  where  in  some  cases  I(t)  was  independent  of  t  and  for  other  processes  it 
was  an  increasing  function  of  t. 

Patterns  with  regularity 

The  extreme  case  of  a  pattern  with  regularity  is  a  completely  regular  pattern,  when 
the  objects  are  positioned  in  a  lattice  (triangular,  square  or  hexagonal).  Since  there 
is  no  random  element  in  their  specification,  and  repeated  realizations  of  the 
process  produce  exactly  the  same  positions,  the  count  X  of  objects  in  any  quadrat 
is  a  fixed  quantity,  and  its  variance  V(X)  is  zero.  Hence  the  index  of  dispersion  is 
zero. 

For  a  lattice  pattern  with  independent  random  displacements,  the  index  is  also  less 
than  one,  as  can  be  seen  using  a  similar  argument  to  that  in  Unit  2,  Section  5,  for 
a  process  of  regular  arrivals  with  perturbations.  Suppose  that  the  pattern  is  spread 
over  a  large  area  A  with  a  density  of  X  objects  per  unit  area.  Each  object  should 
appear  at  a  lattice  point  within  A  but  is,  in  fact,  slightly  displaced  from  that  point. 
Then  E{X),  the  mean  number  of  objects  within  A,  is  XA.  The  only  objects 
associated  with  lattice  points  of  A  that  may  not  lie  within  A  are  those  near  the 
boundary,  and  the  only  extra  objects  within  A  that  are  not  expected  to  be  there 
are  those  associated  with  lattice  points  just  outside  the  boundary.  Objects 
associated  with  interior  lattice  points  will  certainly  lie  within  A.  This  is  illustrated 
in  Figure  4.4.  Accordingly,  the  variance,  V(X),  of  the  number  of  objects  in  A  will 
be  small,  certainly  small  compared  to  E{X),  and  the  ratio  V(X)/E(X)  will  decrease 
as  the  area  A  increases. 

Another  type  of  pattern  with  regularity  is  illustrated  in  the  next  question. 

Question  4.3  Suppose  that  a  spatial  pattern  consists  of  a  two-dimensional 
Bernoulli  process  where  the  probability  that  an  object  is  placed  at  any  lattice  point 
is  p.  Calculate  the  index  of  dispersion  using  rectangular  quadrats  of  size  a  x  b.  □ 

As  you  have  just  found,  the  index  of  dispersion  for  a  Bernoulli  process  is  less 
than  1.  As  p  increases,  the  index  decreases  towards  zero.  This  is  in  accord  with  our 
ideas  about  dispersion,  for  if  p  is  close  to  1,  there  will  be  an  object  at  most  of  the 
lattice  points  and  so  the  pattern  will  show  regularity  and  very  little  dispersion. 


There  might  be  some  slight  variation 
in  the  counts  if,  for  instance,  a  grid 
of  square  quadrats  was  placed  over 
a  triangular  lattice.  But  there  is  zero 
dispersion:  the  estimation  procedure 
is  a  little  inaccurate. 


Figure  4.4 
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Patterns  with  clustering 

The  index  of  dispersion  for  the  model  developed  in  Subsection  3.2  for  a  pattern 
showing  reproductive  clumping  can  be  calculated  using  the  formulas  that  were 
introduced  in  Unit  2,  Subsection#,2.  The  argument  is  exactly  the  same  as  that 
used  in  Subsections  3-T^and  ji^of  Unit  2 ,  so  they  will  be  given  more  briefly  here. 

"2- 

Consider  a  pattern  of  randomly-positioned  clusters  with  a  two-dimensional 
-Poisson  process  of  cluster  centres  with  density  JL,  and  with  cluster  sizes  Kl,K2,  ..., 
which  are  independent  variates  with  mean  n  and  variance  a2.  There  is  no  object  at 
the  cluster  centre;  Kh  the  number  of  objects  in  the  ith  cluster,  is  greater  than  or 
equal  to  one.  We  shall  calculate  the  mean  and  variance  of  X,  the  count  of  objects 
in  a  large  area  A.  For  simplicity,  assume  that  any  cluster  falls  either  entirely  inside 
A  or  entirely  outside  it.  While  this  is,  admittedly,  an  approximation,  it  simplifies 
the  calculations  enormously  and  the  associated  error  is  small  when  A  is  large.  Let 
the  number  of  cluster  centres  falling  within  A  be  a  random  variable  Y  which  has  a 
Poisson  distribution  with  parameter  XA.  So  the  total  number  of  objects  within  A  is 
given  by 

X  =  Ki  +  ...  +  Ky.  .  | 

The  expected  value  of  X  can  be  calculated  using  Formula  of  Unit  2: 

E(X)  =  Ey[E(X\Y)1 

Question  4.4 

(i)  If  y  clusters  are  observed  within  A,  find  the  conditional  expectation 
E(X\Y=y). 

(ii)  Find  the  expected  value  of  X.  □ 


To  find  the  variance  of  X,  we  would  first  calculate  the  expected  value  of  X2,  using 
Fofmi*k-43T6)-0fT^7iTr2^  &  j?  $  6-5bi) 

E(X2)  =  Er[E(X2lY)], 


and  then  find  the  variance  using  V(X)  =  E(X2)  -  [£(2Q]2.  This  is  what  is  required 
in  the  following  question. 


Question  4.5 

(i)  Calculate  the  conditional  expectation  E[X2  \  Y  =  y]. 

(ii)  Hence  calculate  the  variance  of  X. 

(Hint:  This  question  is  very  similar  to  Question  3.1  of  Unit  2)  □ 


So  for  the  randomly-positioned  clusters  model,  the  index  of  dispersion  is  given  by 
_  V(X)  _n2  +  a2 
E{X)  n 

a2 

=  /*  +  —• 

R 

Since  the  cluster  size  is  at  least  1,  K  >  1  and  hence  /j.  >  1.  So  it  follows  that  for 
this  clustered  pattern,  the  index  of  dispersion  is  always  at  least  1.  It  can  be  equal 
to  1  only  when  fx  =  1  and  a2  =  0,  in  which  case  the  clusters  are  all  of  size  one  and 
the  pattern  reduces  to  a  two-dimensional  Poisson  process.  The  larger  a2  is,  the 
larger  /  is;  if  there  is  a  lot  of  variation  between  cluster  sizes,  then  there  is  a  lot  of 
dispersion  in  the  total  number  of  objects  in  an  area. 


4.2  Object-to-object  and  point-to-object  distances 

We  now  return  to  the  two  distance  variates  defined  at  the  beginning  of  this 
section,  and  study  some  of  their  properties:  S,  the  distance  from  a  randomly- 
chosen  object  to  the  nearest  object;  and  R,  the  distance  from  a  randomly-chosen 
point  to  the  nearest  object.  Most  of  this  subsection  will  be  devoted  to  a  study  of 
the  distributions  of  R  and  S  in  a  random  pattern  or  two-dimensional  Poisson 
process;  the  subsection  ends  with  some  general  remarks  about  the  distributions  of 
R  and  S  for  clustered  and  regular  patterns. 
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Let  us  start  by  considering  the  one-dimensional  spatial  Poisson  process. 


Question  4.6  For  a  one-dimensional  Poisson  process  of  objects  in  space  with 
density  X  objects  per  unit  length,  find  the  distribution  of  the  random  variable  S 
and  state  its  mean.  Do  R  and  S  have  the  same  distribution?  □ 

So  for  the  one-dimensional  spatial  Poisson  process,  S  ~  M(2X)  and  R  ~  M( 2X). 

Moving  on  now  to  the  two-dimensional  Poisson  process  with  density  X,  consider 
the  distribution  of  the  distance  R  from  a  randomly-chosen  point,  C,  to  the  nearest 
object  P.  Draw  a  circle  with  centre  C  and  radius  r.  Then  the  event  that  no  object 
lies  inside  the  circle  is  equivalent  to  the  event  R  >  r,  as  can  be  seen  from 
Figure  4.5.  Denote  by  N  the  number  of  objects  in  the  random  pattern  that  lie 
inside  the  circle.  Now  N  has  a  Poisson  distribution  with  parameter  X  times  the 
area  of  the  circle,  i.e.  N  ~  Poisson(7dr2).  So  it  follows  that  the  c.d.f.  of  R  is  given 
by 

F(r)  =  P(R  <  r) 

=  1  -  P(R>r) 

=  1  -  P(N  =  0), 
so 

F(r)  =  1  —  e~nX'2.  (4.1) 

Differentiating  F(r)  gives /(r),  the  p.d.f.  of  R: 

f{r)  =  2nXre~nXrl.  (4.2) 

The  distribution  defined  by  the  p.d.f.  (4.2)  is  an  important  standard  distribution, 
the  Rayleigh  distribution.  It  involves  just  one  parameter,  X,  which  is  a  scale 
parameter.  A  sketch  of  the  p.d.f.  of  the  distribution  is  shown  in  Figure  4.6. 


Figure  4.6  The  p.d.f.  of  the  Rayleigh  distribution 


Question  4.7  Calculate  the  median  of  the  Rayleigh  distribution.  □ 


Since  R  >  0,  the  mean  of  the  Rayleigh  distribution  can  be  calculated  using  the 
formula 


E(R)  = 


(1  ~F(r))dr, 

*<0 


SO 

J*CO 

e~nXr2  dr. 
o 

This  integral  cannot  be  calculated  directly,  but  it  can  be  obtained  using  the  fact 
that  the  p.d.f.  of  the  normal  distribution  integrates  to  1 : 

f  —^=e~x2l2a2dx  =  1. 
j-  CO  (Ty/2n 

Since  the  graph  of  the  function  e~nXr2  is  symmetric  about  the  vertical  axis, 

(*  CO  f'co 

e~nXr2dr=\\  e~nXr2dr. 

Jo  J-co 


^Object  nearest  to  C 
falls  outside  circle 


Figure  4.5 


The  Rayleigh  distribution  is 
introduced  briefly  in  M245  Unit  5, 
page  33. 


Unit  1,  Equation  (^,1) 


This  kind  of  argument  was  used  to 
obtain  the  standard  result 


which  is  given  in  the  Handbook. 
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Now,  letting  nA  =  l/2a2  so  that  a  =  l/^2nA,  we  have 


E(R)  =  i 


e~r2/2a2  dr 


=  j 


0.5 

/T' 
A 


V 


So  the  mean  point-to-object  distance  is  just  slightly  greater  than  the  median 
distance,  which  was  found  to  be  0.470/,/A  in  Question  4.7. 


Question  4.8  Calculate  the  standard  deviation  of  R.  □ 

It  was  stated  above  that  A  is  a  scale  parameter  of  the  Rayleigh  distribution.  This  r 
demonstrated  in  the  following  question. 

Question  4.9  Find  a  value  of  k  such  that  the  distribution  of  Z  =  kR  is 
independent  of  A.  □ 


The  value  of  k  that  you  found  in  this  question  is  proportional  to  ,/A.  As  A  is  the 
density  of  objects  per  unit  area,  it  has  dimensions  (, area )_1  or  (lengthy2,  so  1  jjl 
has  the  dimensions  of  a  distance  (and  the  mean,  median  and  standard  deviation  of 
R  are  all  proportional  to  it). 

There  is  a  close  connection  between  Rayleigh  and  exponential  distributions,  as  can 
be  seen  from  the  following  question. 


Question  4.10 

(i)  If  R  has  a  Rayleigh  distribution  with  parameter  A,  find  the  distribution  of  R2. 

(ii)  If  R{,R2, ...,  Rn  are  independent  random  variables,  each  having  a  Rayleigh 
distribution  with  parameter  A,  write  down  the  distribution  of  £  Rf.  □ 

i=  1 


The  distribution  of  R  has  been  derived  for  the  two-dimensional  Poisson  process, 

and  properties  of  this  distribution  have  been  found.  By  Postulate  III  of  the 

process,  which  states  that  the  occurrences  of  objects  in  non-overlapping  areas  are 

independent  events,  it  can  be  deduced  that  R  and  S  (the  object-to-object  distance) 

have  the  same  distribution.  This  follows  because  if  an  object  is  chosen  at  random 

and  a  circle  drawn  with  this  object  as  centre,  then  the  distribution  of  the  number 

of  objects  other  than  the  chosen  one  inside  the  circle  does  not  depend  on  the  fact  ' 

that  there  is  an  object  at  the  centre.  Exactly  the  same  argument  applies  for  the 

distribution  of  5  as  was  used  for  the  distribution  of  R.  This  gives  the  following 

general  result. 


For  the  two-dimensional  Poisson  process  with  density  A,  the 
variates  R  and  5  both  have  a  Rayleigh  distribution  with 
p.d.f. 

f(x)  =  2nAxe~n2x2,  x  >  0. 


We  turn  now  to  clustered  and  regular  patterns. 

Firstly,  consider  a  clustered  pattern  such  as  the  one  invented  in  Figure  4.7.  For 
simplicity,  the  pattern  has  been  drawn  so  that  it  consists  of  non-overlapping 
clusters  of  objects  with  the  clusters  widely  separated,  so  that  much  of  the  surveyed 
area  consists  of  ‘empty  space’  between  the  clusters. 

An  5-distance  is  measured  from  a  randomly-chosen  object,  P  say,  to  its  nearest 
neighbouring  object,  Pl5  so  that  S  —  PPX.  Assuming  there  are  at  least  two  objects 
in  any  cluster,  Pl  will  be  in  the  same  cluster  as  P;  so  5-distances  will  tend  to  be 


Figure  4.7  A  clustered  pattern 
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small.  An  K-distance,  on  the  other  hand,  is  measured  from  a  randomly-chosen 
point,  C  say,  to  the  nearest  object,  labelled  P2,  so  that  R  =  CP2.  Most  points  C  fall 
in  the  empty  space  between  the  clusters,  giving  /^-values  which  will  tend  to  be 
large.  The  majority  of  K-values  will  exceed  the  majority  of  S-values;  so  R  will  be 
distributed  to  the  right  of  S,  in  the  sense  that  P(R  >  a)  >  P(S  >  a)  for  any  a  >  0. 

Also,  E(R)  >  E(S).  This  is  different  to  what  happens  in  the  case  of  a  two- 
dimensional  Poisson  process,  where  R  and  S  have  the  same  distribution. 

This  is  true  for  all  patterns  with  clustering,  even  if  they  do  not  consist  of  clearly 
separated  clusters  as  in  Figure  4.7.  In  general,  the  less  pronounced  the  clusters,  the 
less  difference  there  will  be  between  E(R)  and  E(S).  In  the  next  question,  a  specific 
case  of  the  randomly-positioned  clusters  model  is  described,  and  you  are  asked  to 
calculate  the  ratio  of  E(S )  to  E(R). 

Question  4.11  Points  are  distributed  over  a  large  area  in  a  spatial  Poisson 
process  with  density  A.  A  proportion  p  of  the  points  serve  as  cluster  centres  for 
clusters  of  size  2,  each  cluster  consisting  of  two  objects  separated  by  a  distance  d, 
with  mid-point  at  the  cluster  centre.  The  remaining  proportion  1  —  p  of  the  points 
serve  as  the  actual  locations  of  objects  (clusters  of  size  one).  If  d^f).  is  much  less 
than  1,  show  that  E(S)/E(R)  is  approximately  equal  to  (1  —  p)/{\  +  p)  <  1  for  this 
clustered  pattern  of  objects.  □ 

Secondly,  what  general  remarks  can  be  made  about  the  distributions  of  R  and  S 
for  patterns  with  regularity?  For  a  clustered  pattern,  ^-values  tend  to  exceed  S- 
values;  for  a  random  pattern,  R-  and  S-values  have  the  same  distribution.  This 
suggests  that,  for  a  regular  pattern,  P-values  will  tend  to  be  less  than  S-values;  and 
R  will  be  distributed  to  the  left  of  S,  in  the  sense  that  P(R  >  a)  <  P{S  >  a)  for  any 
a.  This  is  indeed  the  case. 

To  appreciate  that  this  is  so,  consider  a  completely  regular  square  lattice  pattern 

with  squares  of  side  a,  as  shown  in  Figure  4.8.  Because  the  pattern  is  completely  £> _ p  #  P; 

regular,  every  object-to-object  distance  is  equal  to  a:  P(S  =  a)  =  1.  On  the  other  a  .  .  S 

hand,  R  is  a  maximum  when  the  random  point,  C,  is  at  the  centre  of  a  square;  in  o 

this  case  R  =  aj 2/2  or  0.71a.  So  R  is  always  considerably  less  than  S. 

Although  the  difference  is  less  marked  when  the  pattern  is  not  completely  regular,  •  •  •  • 

you  can  see  that  there  is  always  a  tendency  for  R  to  be  less  than  S.  Whether  the  Figure  4.8 

model  is  a  lattice  pattern  with  independent  random  displacements,  or  an  inhibition 

process,  objects  do  not  occur  very  close  to  each  other  and  so  S-distances  cannot  be 

very  small.  On  the  other  hand,  a  random  point  may  be  anywhere  in  the  plane  so 

there  is  a  non-zero  probability  that  R  may  take  any  value,  however  small. 

The  results  can  be  summarized  as  follows. 


If  a  is  any  positive  quantity, 

(i)  for  a  clustered  pattern,  E(R)  >  E{S)  and  P{R  >  a)  >  P(S  >  a); 

(ii)  for  a  pattern  with  regularity,  E(R)  <  E(S)  and  P(R  >  a)  <  P(S  >  a). 


5  Detecting  departures  from  randomness 


In  the  first  four  sections  of  this  unit  we  have  studied  various  models  for  different 
types  of  spatial  pattern;  we  have  seen  how  the  models  are  constructed  and  in 
Section  4  we  studied  their  properties.  In  particular,  we  have  found  that  most 
spatial  patterns  fall  into  one  of  three  categories:  random,  regular  or  clustered.  If  we 
observe  some  particular  pattern,  it  is  reasonable  to  ask  to  which  category  of 
pattern  it  belongs.  Because  this  is  a  fundamental  question  about  a  spatial  pattern, 
we  shall  depart  from  the  usual  policy  of  this  course  to  introduce  some  statistical 
tests  of  hypotheses. 


The  concept  of  hypothesis  testing  is 
introduced  in  M245  Unit  7.  This 
section  uses  only  the  idea;  you  do 
not  need  to  know  details  from 
M245. 
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The  specific  aim  of  this  section  is  to  describe  two  tests  of  the  null  hypothesis  that  a 
particular  spatial  pattern  is  a  random  pattern  against  the  general  alternative 
hypothesis  that  the  pattern  is  not  random.  We  saw  in  Subsection  4.1  that  counts  in 
quadrats  have  different  properties  for  random,  regular  and  clustered  patterns,  and 
in  Subsection  4.2  we  noted  that  this  is  also  true  for  R-  and  S-distances.  So  in 
Subsection  5.1  we  study  a  hypothesis  test  that  uses  counts,  and  the  test  described 
in  Subsection  5.2  uses  R-  and  S-distances. 


5.1  A  test  based  on  counts 


Let  X  be  the  count  of  objects  in  a  spatial  pattern  falling  in  any  region  of  area  A. 
Then  X  is  a  random  variable:  the  index  of  dispersion  of  the  pattern  has  been 
defined  as  /  =  V(X)/E(X)  and,  as  we  saw  in  Subsection  4.1, 

/  >  1  for  a  clustered  pattern, 

I  =  1  for  a  random  pattern, 

/  <  1  for  a  regular  pattern. 

For  any  particular  pattern  we  can  observe  the  counts  x,,x2,  . . .,  xk  in  k  non¬ 
overlapping  quadrats  each  of  the  same  area  A,  which  together  cover  all  or  most  of 

k 

the  surveyed  region.  Denote  the  total  count,  £  xh  by  N. 


i=  1 


These  counts  are  observations  on  the  random  variables  Xl9X2,...,Xk  which  are 
the  numbers  of  objects  in  the  k  quadrats.  If  the  pattern  is  random,  then  the  Xt  are 
independent  and  each  X(  has  a  Poisson  distribution,  with  mean  XA.  The  sample 
mean  and  sample  variance  are  defined,  respectively,  by 


X  = 


Z*i 

f= i 


and 


Z  (Xt  -  X)'' 
s2  =  — 


k  -  1 

These  quantities  can  be  estimated  using 


x  = 


E  *i 

i=  l 

k 

N 

k 


(5.1) 


and 


2  *  -  1 
SZ  = - 


Z  (*;  -  x): 


(5-2) 

The  ratio  s2/x  provides  an  estimate  of  the  index  of  dispersion.  If  the  spatial  process 
is  a  two-dimensional  Poisson  process,  we  should  expect  the  value  of  s2/x  to  be 
reasonably  near  to  1,  i.e.  no  further  from  1  than  can  be  accounted  for  by  sampling 
variability  on  the  assumption  that  the  pattern  is  random.  If  s2/x  is  considerably 
greater  than  1,  then  clustering  is  indicated,  and  if  s2/x  is  much  less  than  1,  then 
regularity  is  indicated. 

To  test  the  null  hypothesis,  it  is  convenient  to  work  in  terms  of  the  statistic 
(k-  1)S2 


T  = 


X 


When  the  null  hypothesis  is  true,  and  so  the  pattern  is  random,  then  the  sampling 
distribution  of  T  is  approximately  chi-squared  with  parameter  (k  -  1),  provided 
that  N  and  N/k  are  both  sufficiently  large.  For  practical  purposes,  this  is  taken  as 
N  >  20,  N/k  >  4.  (5.3) 

That  is,  there  should  be  at  least  20  objects  in  the  pattern,  with  an  average  of  at 
least  four  per  quadrat. 
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The  observed  value  of  the  statistic  T  is 
t  ( k-\)s 2 

x 

k  X  (xt  -  x )2 

t  =  1 _ 

N 

using  Formulas  (5.1)  and  (5.2);  this  forms  the  basis  of  the  hypothesis  test  which  is 
often  called  by  statisticians  the  j2  dispersion  test.  Its  use  is  illustrated  in  the 
following  example. 


Example  5.1  The  x2  dispersion  test 


Figure  2.7  showed  a  simulated  random  pattern  with  35  objects  in  a  square.  We 
shall  carry  out  a  x 2  dispersion  test  to  investigate  whether  the  pattern  is  really 
random.  Since  N  =  35,  the  first  condition  of  (5.3)  is  satisfied.  For  the  second 
condition,  N/k  >  4,  to  be  satisfied,  we  must  take  k  <  8.75.  In  Figure  4.2,  the 
pattern  was  divided  into  four  quadrats  and  k  =  4  satisfies  the  second  condition. 


The  observed  values  of  Xt  are  9,7, 10,9.  Hence 

X(x,--x)2  =  5]x?-(Xx,-)2/4 

=  311  -  352/4  =4.75. 

The  observed  value  of  T  is 
4  x  4.75 


35 


=  0.543. 


Since  this  unit  is  concerned  with  the 
properties  of  spatial  patterns,  no 
theoretical  justification  is  given  for 
the  use  of  the  y2  distribution.  If  you 
have  studied  M245  Unit  14  or  M345 
Unit  9,  you  may  recognize  T  as  the 
goodness-of-fit  statistic 
IlD?/El  =  'Z(Ol-Ed2/Ei. 


This  is  a  two-sided  test  as  we  wish  to  reject  the  null  hypothesis  of  randomness  if 
the  observed  value  of  T  is  either  very  large  or  very  small.  The  parameter  of  the  x2 
distribution  is  v  =  4  —  1  =3.  Taking  the  level  of  the  test  to  be  0.05,  critical  values 
for  t  can  be  found  from  the  table  on  page  21  of  Neave.  Reading  down  the  two 
columns  for  which  a2  =  5%  (=  0.05)  and  across  the  row  corresponding  to  v  =  3, 
the  critical  region  for  the  test  is  given  by  t  <  0.216  (small  values  of  t)  and  t  >  9.348 
(larger  values  of  t  than  would  be  expected  under  a  hypothesis  of  randomness).  The 
observed  value  of  0.543  does  not  fall  in  the  critical  region,  so  there  is  no  reason  to 
doubt  that  the  pattern  is  a  random  one.  (This  is  a  relief,  since  we  actually 
simulated  the  pattern  in  accord  with  a  spatially  random  model!)  □ 

Question  5.1  In  Figure  5.1,  the  pattern  of  211  white  oak  trees  from  Figure  1.3  is 
reproduced  and  divided  into  25  equal-sized  quadrats.  Count  the  trees  in  each 
quadrat.  Use  a  x2  dispersion  test  with  a  level  of  0.05  to  investigate  whether  the 
pattern  can  reasonably  be  regarded  as  random,  and  state  your  conclusion.  □ 


This  approach  to  hypothesis  testing 
is  the  one  used  in  M245.  If  you  have 
studied  M345 ,  you  will  be  more 
familiar  with  quoting  significance 
probabilities,  but  the  two 
approaches  are  essentially  the  same. 
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Figure  5.1 


The  choice  of  quadrats 

In  Question  5.1,  counts  were  obtained  by  dividing  the  total  area  into  25  equal 
square-shaped  quadrats.  Do  the  quadrats  have  to  be  square  in  shape?  How  should 
they  be  chosen?  What  if  the  surveyed  area  is  not  square  (when  division  into  square 
quadrats  is  straightforward)  or  rectangular,  but  irregular?  And  why  25?  How 
should  the  number  of  quadrats  be  chosen?  Here  are  some  answers. 

■Quadrats  do  not  have  to  be  square— it  does  not  matter  what  their  shape  is.  This 
is  because  the  count  X  of  objects  in  a  random  pattern  of  density  A  falling  in  a 
region  of  area  A  has  the  Poisson(/U)  distribution  whatever  the  shape  of  the  region. 

Example  5.2 

Figure  5.2  shows  the  numbers  of  the  large  woodlouse  Philoscia  muscorum  collected 
in  37  contiguous  hexagonal  quadrats  of  beech  litter  at  Wythan  Woods  near 
Oxford,  on  30  October  1958.  The  quadrats  are  1  ft  across,  so  each  has  an  area  of 
0.866  ft2.  In  this  data  set  we  do  not  have  the  actual  spatial  pattern  of  objects;  the 
individual  positions  of  the  53  woodlice  were  never  recorded,  only  their  counts  in 
the  various  quadrats.  So  R-  and  S-distances  are  not  available  in  this  case. 

For  this  example,  N  =  53,  k  =  37,  so  the  average  number  of  objects  per  quadrat  is 
N/k  <  4  and  it  is  not  appropriate  to  apply  the  %2  dispersion  test.  It  would  be 
possible  to  apply  it  if  the  quadrats  were  combined  into  larger  areas.  □ 

There  is  another  important  feature  of  this  data  set.  In  Figure  5.1,  the  pattern  of 
white  oaks  was  divided  into  25  quadrats  simply  by  drawing  lines  on  a  diagram  of 
the  pattern.  The  quadrats  are  a  mere  geometrical  construction,  and  we  could  have 
drawn  a  different  set  of  lines  on  the  diagram  and  divided  the  pattern  into  a 
different  array  of  quadrats.  But  in  Figure  5.2  the  hexagonal  quadrats  have  a  real 
existence  as  part  of  the  zoological  experiment;  they  are  actual  areas  of  beech  litter 
marked  out  physically  by  the  zoologist.  So  there  are  cases  in  which  the  statistician 
is  given  a  pattern  of  object  positions  and  can  mark  out  quadrats,  freely  chosen  for 
shape,  size  and  position,  on  a  map  of  the  pattern;  and  there  are  cases  where  the 
quadrats  are  physical  entities  presented  to  the  statistician  as  part  of  the  data.  In 
the  latter  situation  it  may  well  be  that  only  the  counts  of  objects  in  the  quadrats 
have  been  recorded,  and  not  their  individual  positions. 

Question  5.2  Figure  5.3  shows  the  numbers  of  the  small  woodlouse  Trichoniscus 
pusillus  provisorius  taken  in  24  very  small  contiguous  quadrats  of  beech  litter.  All 
quadrats  have  the  same  area  (0.124  ft2)  but  not  the  same  shape.  Is  a  x2  dispersion 
test  appropriate  for  these  data?  If  so,  perform  the  test  using  a  level  of  0.01.  □ 

In  Question  5.1,  on  the  pattern  of  white  oak  trees,  the  number  and  pattern  of 
quadrats  was  open  to  choice,  since  we  were  free  to  mark  them  out  on  Figure  5.1 
as  we  pleased.  There  are  21 1  trees,  so  N  =  21 1,  amply  meeting  the  first  of  the 
conditions  given  in  (5.3),  and  k  can  then  be  chosen  to  have  any  convenient  value 
up  to  about  55.  The  square  area  of  survey  divides  naturally  into  square  quadrats, 
so  25  is  a  suitable  number,  giving  a  substantial  number  of  degrees  of  freedom  (24) 
for  the  x2  test.  To  have  used,  say,  49  smaller  quadrats  would  have  involved 
substantial  extra  work  in  marking  out  the  diagram  and  taking  counts,  without 
much  advantage;  while  to  have  used  a  coarser  division  into,  say,  16  or  9  square 
quadrats  would  have  meant  substantial  loss  of  information.  On  the  other  hand, 
suppose  we  had  divided  the  area  of  survey  into  16  equal  squares  and  then  drawn 
the  SW/NE  diagonal  of  each  square;  this  would  have  given  32  triangular  quadrats, 
a  perfectly  good  procedure,  broadly  equivalent  to  using  25  or  36  square  quadrats. 

If  the  surveyed  area  is  not  a  square,  then  the  pattern  of  quadrats  can  be  chosen  in 
many  ways.  It  is  important  that  the  quadrats  cover  as  much  as  possible  of  the  area 
so  that  as  little  as  possible  of  the  information  is  lost. 

Question  5.3  Look  back  at  the  pattern  of  72  seedlings  in  region  I  of  Figure  1.6. 
Suppose  that  you  are  asked  to  test  the  pattern  for  randomness  by  means  of  a  x2 
dispersion  test  on  quadrat  counts.  How  would  you  construct  the  quadrats?  □ 


Source:  M.  Lloyd,  ‘Mean  crowding’, 
Journal  of  Animal  Ecology,  vol.  36 
(1967)  p.  1. 
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Figure  5.2  Numbers  of  the  large 
woodlouse  collected  in  hexagonal 
quadrats 


Source:  Lloyd,  ‘Mean  crowding’. 
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Figure  5.3 
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Question  5.4  For  the  pattern  of  post  offices  in  Figure  1.5,  do  you  think  it  would 
be  feasible  to  test  for  randomness  using  quadrat  counts?  If  so,  say  briefly  and 
without  going  into  detail  how  you  would  set  about  doing  it.  What  disadvantages 
do  you  think  would  be  involved?  □ 


5.2  A  test  based  on  distances 


In  Section  4,  the  variate  S  was  defined  to  be  the  distance  from  a  randomly  chosen 
object  to  the  nearest  object,  and  the  variate  R  was  defined  as  the  distance  from  a 
randomly  chosen  point  to  the  nearest  object.  In  Subsection  4.2  we  showed  that: 

(a)  R  and  S  have  the  same  distribution  for  a  random  pattern; 

(b)  /^-distances  tend  to  exceed  S-distances  for  a  clustered  pattern; 

(c)  S-distances  tend  to  exceed  F-distances  for  a  regular  pattern. 

So  it  should  be  possible  to  use  samples  of  ^-distances  and  S-distances  for  detecting 
departures  from  randomness  in  the  pattern.  There  are  at  least  six  different  methods 
for  doing  this,  and  several  further  ones  involving  the  distance  from  a  randomly 
chosen  point  to  the  second  nearest  object.  In  this  unit  we  shall  discuss  just  one  of 
the  methods;  it  dates  from  1954  and  is  due  to  B.  Hopkins. 

Hopkins’  test 

Suppose  we  are  given  a  spatial  pattern  with  N  objects  in  the  surveyed  area,  and 
we  wish  to  test  the  null  hypothesis  stated  in  the  introduction  to  this  section  that 
the  pattern  is  part  of  a  two-dimensional  Poisson  process.  The  procedure  for 
Hopkins’  test  is  as  follows.  Choose  a  sample  size  n.  Then  select  n  points  at  random 
in  the  surveyed  area  and  measure  the  distances  rx,r2,  . . .,  r„  from  each  point  to  the 
object  nearest  to  it;  also  select  n  objects  at  random  from  the  N  in  the  pattern  and 
measure  the  distances  s1,s2,  ...,  s„  from  each  object  to  its  nearest  neighbour.  Then 
calculate  the  test  statistic 

n 

U  _  ■'=! 


i  -  1 

where  h  is  a  realization  of  the  random  variable  H  =  'ZR?/'Lsf-  If  the  pattern  is 
random,  we  should  expect  h  to  be  reasonably  near  to  1,  i.e.  no  further  from  1  than 
can  be  accounted  for  by  sampling  variability  on  the  assumption  of  a  random 
pattern.  If  h  is  considerably  larger  than  1,  then  clustering  is  indicated,  but  if  /i  is 
considerably  less  than  1,  then  regularity  is  indicated. 


‘A  new  method  of  determining  the 
type  of  distribution  of  plant 
individuals’,  Annals  of  Botany, 
vol.  18  (1954)  pp.  21 3-26. 


In  Question  4.10  you  showed  that  for  a  random  pattern  £  Rf  has  a  gamma 

i  =  1 

distribution  with  parameters  n  and  nX;  since  S  and  R  have  the  same  distribution 

n 

for  a  random  pattern,  £  Sf  has  the  same  gamma  distribution.  These  distributions 

i=  1 

are  used  to  derive  the  distribution  of  H,  assuming  that  the  null  hypothesis  of 
randomness  is  true.  Its  distribution  is  known  as  the  F-distribution,  and  for  samples 
of  size  n,  its  two  parameters  (known  as  its  degrees  of  freedom)  are  vq  =  v2  =  2 n. 
This  distribution  is  tabulated  in  Neave,  pages  22-5. 

Example  5.3  Use  of  the  F-tables  for  Hopkins’  test 


The  F-distribution  is  introduced  and 
used  extensively  in  M345.  However, 
for  this  course,  you  are  not  assumed 
to  have  met  the  F-distribution 
previously. 


When  using  Hopkins’  test  we  wish  to  reject  the  null  hypothesis  if  h  is  either  much 
larger  or  much  smaller  than  1.  Accordingly,  the  alternative  hypothesis  is  always 
two-sided  and  the  critical  level  is  given  in  Neave,  as  a  percentage,  by  a2.  So  if  the 
chosen  level  of  the  test  is  0.05,  use  the  lower  table  on  page  23,  and  if  the  chosen 
level  is  0.01,  use  the  lower  table  on  page  24.  The  tables  give  only  the  upper  critical 
point  of  the  F-distribution;  the  lower  critical  point  is  obtained  by  calculating  the 
reciprocal  of  the  given  value. 

Suppose  n  =  5  and  we  have  chosen  a  =  0.05.  Then  vx  =  v2  =  10  and  the  test  of  the 
null  hypothesis  is:  reject  H0  if  h  >  3.717  or  if  h  <  1/3.717  =  0.269.  If  h  >  3.717,  this 
provides  evidence  for  clustering,  but  if  h  <  0.269,  this  is  evidence  in  favour  of 
regularity.  □ 
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Example  5.4  London  post  offices 

We  shall  use  Hopkins’  test  to  investigate  the  randomness  or  otherwise  of  the 
pattern  of  the  62  London  post  offices  shown  in  Figure  1.5.  For  this  example,  the 
sample  size  n  =  6  was  chosen  in  order  to  keep  the  calculations  fairly  short.  Points 
are  denoted  by  (x,  ^-coordinates  with  the  bottom  left-hand  corner  of  the  frame 
having  coordinates  (0,0).  The  scale  used  is  marked  in  Figure  5.4  and  is  such  that 
othe  top  right-hand  corner  has  coordinates  (24, 18);  the  objects  are  numbered  from 
01  to  62. 


Figure  5.4 

Using  a  procedure  similar  to  those  used  earlier  in  the  unit,  six  points  Cu  ...,  C6 
were  selected  at  random.  These  are  also  marked  on  Figure  5.4,  together  with  an 
arrow  indicating  the  object  nearest  to  each  point  Ct.  It  is  essential  that  all  points 
C,-  fall  within  the  area  of  the  survey;  so  if  a  point  were  selected  in  the  river  or  one 
of  the  parks,  it  would  be  discarded.  Table  5.1  shows  the  calculation  of  the  six 
distances  rt. 


Table  5.1  /^-distances  for  the  London  post  offices 


Point,  Cf 

Nearest  object,  Pf 

Distance,  r; 

c, 

(2.9, 4.3) 

Pos 

(4.3, 4.3) 

1.4 

C2 

(19.4,11.6) 

Psi 

(19.5, 12.0) 

0.4 

C3 

(16.0,16.1) 

P38 

(14.6,15.5) 

1.5 

c4 

(11.4,3.6) 

P32 

(12.3,3.4) 

0.9 

c5 

(18.8,10.2) 

P 4.9 

(18.3,11.3) 

1.2 

c« 

(20.3,13.2) 

P55 

(21.1,13.2) 

0.8 

Next,  six  objects  were  selected  at  random  from  the  62  possible  objects.  These  are 
listed  in  Table  5.2,  together  with  the  nearest  objects  and  the  distances  s(. 

Table  5.2  5-distances  for  the  London  post  offices 


Object  Nearest  object  Distance,  s, 


P48 

(18.2,15.6) 

Psx 

(18.6,17.6) 

2.0 

P25 

(10.9,8.9) 

P21 

(10.3,7.8) 

1.3 

P52 

(19.1,0.8) 

P44 

(16.8,0.5) 

2.3 

P08 

(4.3, 4.3) 

Po4 

(1.3, 2.9) 

3.3 

Pi  5 

(7.8, 2.7) 

Pi  9 

(8.9, 1.7) 

1.5 

P51 

(18.6,17.6) 

P48 

(18.2,15.6) 

2.0 

The  original  map  measures 
24  cm  x  18  cm. 


The  second  random  selection  gave  a 
point  in  Regent’s  Park  and  was 
rejected. 
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You  can  see  by  inspection  that  R-distances  do  tend  to  be  less  than  S-distances  for 
this  example.  (For  this  small  sample,  edge  effects  are  noticeable.  For  instance,  P08 
is  close  to  a  boundary  so  it  might  have  had  a  neighbour  closer  than  P04  if  the 
pattern  had  continued,  but  we  ignore  edge  effects  here.) 

We  next  proceed  to  calculate  h: 


Since  n  =  6,  the  parameters  of  the  /-'-distribution  are  v1  =  v2  =  12.  So,  using  a  level 
of  0.05,  the  critical  region  is  h  >  3.277  and  h  <  1/3.277  =  0.305.  Hence  we  reject 
the  null  hypothesis.  Since  h  is  less  than  1,  we  conclude  that  there  is  some  evidence 
for  regularity  in  the  pattern  of  post  offices.  □ 

Question  5.5  In  Example  1.6  it  was  suggested  that  the  patterns  of  redwood 
seedlings  in  the  two  regions  might  be  different.  Random  samples  of  five  points  and 
five  objects  were  selected  from  region  II  and  the  R-  and  S-distances  were 
measured.  The  five  observed  r-distances  are  12,9,2,3, 17,  and  the  s-distances  are 
2,6,3, 1, 1.  Carry  out  a  test  of  the  hypothesis  that  the  seedlings  in  region  II  are 
distributed  at  random,  using  a  level  of  0.01.  □ 

This  has  been  a  first  look  at  the  fascinating  topic  of  spatial  patterns  and  their 
properties.  We  have  studied  the  properties  of  the  Poisson  process  in  two 
dimensions  and  of  the  random  patterns  it  generates.  We  have  seen  how  there  are 
two  main  ways  in  which  a  pattern  may  depart  from  randomness:  by  displaying 
either  clustering  or  regularity.  The  broad  classification  regular/random/clustered 
has  served  us  well,  but  it  does  not  cover  all  patterns;  consider,  for  example,  a 
pattern  of  clusters  of  objects,  where  the  cluster  centres  are,  say,  at  the  vertices  of  a 
square  lattice!  This  pattern  combines  regularity  of  cluster  centres  with  local 
clustering  about  these  centres.  What  are  its  characteristics? 

Spatial  patterns  present  a  wide  field  of  problems,  many  of  which  have  not  been 
touched  upon.  For  example,  what  sort  of  pattern  results  from  the  superposition  of 
two  known  patterns?  How  can  the  relationship  between  two  patterns  be  described 
in  probabilistic  terms?  In  the  location  diagram  of  ants’  nests  of  two  species  shown 
in  Figure  1.7,  is  there  any  evidence  of  interdependence  between  the  positions  of  the 
two  types  of  nest,  e.g.  do  Cataglyphis  ants  tend  to  nest  near  Messor  ants,  or 
perhaps,  on  the  other  hand,  tend  to  avoid  their  nests? 

For  a  simple  sequential  inhibition  process,  what  is  the  average  number  of  objects 
which  can  be  placed  in  a  given  area  before  it  is  saturated,  and  what  is  the 
distribution  of  this  number?  How  might  a  spatial  pattern  of  trees  be  affected  by 
the  ages  of  the  individual  trees,  and  how  might  this  be  modelled? 

Not  all  of  these  questions  have  yet  been  answered  by  research  workers.  Spatial 
processes  are  a  popular  and  flourishing  area  of  current  research  in  probability  and 
statistics.  The  problem  addressed  in  the  next  unit  is  quite  different,  and  was  posed 
over  a  century  ago. 
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Objectives 


After  studying  this  unit  you  should  be  able  to: 
identify  the  main  features  of  a  spatial  pattern; 

define  a  two-dimensional  Bernoulli  process,  and  simulate  objects  in  such  a  process; 

explain  what  is  meant  by  a  two-dimensional  Poisson  process,  and  simulate  objects 
in  it; 

show  that  the  pattern  in  a  transect  from  a  two-dimensional  random  pattern  is  a 
one-dimensional  Poisson  process; 

give  practical  examples  of  patterns  with  regularity  and  with  clustering,  and 
describe  their  main  features; 

explain  what  is  meant  by  completely  regular  patterns,  lattice  patterns  with  random 
displacements,  and  simple  sequential  inhibition  processes,  and  simulate 
observations  from  these  patterns; 

describe  what  is  meant  by  a  randomly-positioned  clusters  model,  and  simulate 
objects  in  such  a  pattern; 

calculate  the  index  of  dispersion  for  simple  spatial  patterns,  and  explain  why  it 
takes  different  values  for  different  patterns; 

derive  the  p.d.f.  of  the  Rayleigh  distribution  for  R-  and  S-distances  in  a  Poisson 
process,  and  calculate  its  mean,  median  and  standard  deviation; 

explain  why  R-distances  tend  to  be  greater  than  S-distances  for  clustered  patterns 
and  less  than  S-distances  for  regular  patterns; 

use  the  * 2  dispersion  test  and  Hopkins’  test  to  test  the  hypothesis  that  a  spatial 
pattern  is  a  random  pattern,  and  report  the  conclusions  you  draw  from  the  test. 
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Appendix:  Solutions  to  questions 


Section  2 

2.1(i)  The  remaining  50  digits  in  row  1  include  four  9s,  so 
the  simulated  pattern  will  contain  four  objects. 

(ii)  The  incidence  of  9s  can  be  thought  of  as  a  Bernoulli 
process  with  p  =  0.1.  So  X  ~  5(50,0.1). 

2.2  Stage  1  The  simulated  value  of  u  is  0.9432.  From 
page  14  of  Neave,  for  the  Poisson  distribution  with 
parameter  5, 

P(X  <  8)  =  0.9319,  P(X  <  9)  =  0.9682. 

Hence  the  simulated  value  is  x  =  9,  and  there  will  be  nine 
objects  in  the  simulation. 

Stage  2  To  simulate  the  positions  of  the  nine  objects,  we 
need  nine  ratios  APJAB.  Reading  down  column  2  gives 
APJAB  =  0.217,  AP2/AB  =  0.618  etc.  Since  AB  =  50,  the 
lengths  AP{  are  given  by 

AP,  =  10.85,  AP4  =  14.1,  AP7  =  30.75, 

AP2  =  30.9,  AP5  =  39.6,  AP8  =  12.9, 

AP3  =  30.0,  AP6  =  40.55,  AP9  =  46.45. 

, _ ,  _ t 

A  Pi  P»  Pa  Pi  Pi  /%/’„  ^  B 

2.3(i)  The  number  per  square  metre  is  distributed  as 
Poisson(1.2).  The  shape  of  the  region  is  immaterial.  The 
probability  that  there  are  exactly  three  primroses  in  the 
square  is  e~  12(1.2)3/3!  =  0.0867. 

(ii)  0.0867 

(iii)  The  probability  that  the  region  (shape  unspecified)  is 
empty  is  e~12  =  0.3012. 

(iv)  In  two  square  metres,  the  number  is  distributed  as 
Poisson(2.4).  So 

P(at  least  five  primroses) 

=  1  —  P(four  or  fewer  primroses) 

=  1  -  0.0907  -  0.2177  -  0.2613  -  0.2090  -  0.1254 
(using  Neave,  page  14) 

=  0.0959. 

2.4  Draw  a  rectangle  which  includes  the  irregular  area 
completely.  The  sides  may  be  in  any  convenient  direction. 
Then,  using  the  method  already  described,  simulate  a  two- 
dimensional  Poisson  process  with  the  given  density  over  the 
entire  rectangle.  If  a  simulated  point  falls  within  the  irregular 
area,  it  forms  part  of  the  simulated  pattern;  if  it  is  outside 
the  area,  then  it  is  ignored. 


Section  3 

3.1  Since  the  probability  that  a  tree  dies  is  p,  the 
probability  that  a  tree  survives  is  1  —  p.  The  deaths  occur 
independently,  so  the  pattern  is  a  two-dimensional  Bernoulli 
process  with  parameter  1  —  p. 

3.2  The  model  of  random  deaths  assumes  that  death 
occurs  independently  from  tree  to  tree.  In  a  real-life  situation 
a  tree  might  well  be  more  likely  to  die  if  adjacent  to  a 
diseased  (and  subsequently  dead)  tree  than  if  surrounded  by 
healthy  trees;  i.e.  disease  could  occur  by  contagion.  So  the 
assumption  of  independence  could  well  be  unrealistic. 


3.3  The  random  uniform  variates  required  for  the  next 
three  points  are  0.573,  0.849,  0.467,  0.836,  0.354,  0.392.  So 
multiplying  by  20,  the  coordinates  of  P36,  PJ7  and  P38  are 
(11.46, 16.98),  (9.34, 16.72)  and  (7.08,7.84)  respectively. 

The  distance  P33P36  is  1.17,  so  since  P33  is  retained  in  the 
inhibited  pattern  of  Figure  3.4,  the  point  P36  is  rejected. 
Similarly,  P22P31  =  2.84  so  P31  is  rejected.  The  point  P38  falls 
in  the  middle  of  an  empty  space,  and  becomes  the  next 
object  in  the  process. 


Section  4 

4.1  P2  is  the  object  nearest  to  P18.  The  S- value  for  P1S  is 
1.41,  the  same  as  for  P2. 

4.2  No,  this  is  not  necessary.  For  example,  P26  is  the 
object  nearest  to  P7,  but  the  object  nearest  to  P26  is  P19,  not 
P7.  The  S-value  for  P7  is  2.22  and  for  P26  it  is  1.53.  (There 
are  other  examples  in  Figure  4.3.) 

4.3  The  number  of  objects  in  this  spatial  pattern  has  a 
binomial  distribution,  i.e.  X  ~  B(ab,p).  Hence  E(X)  =  abp 
and  V(X)  =  abp(  1  —  p),  so  1  =  1  —  p. 

4.4(i)  If  it  is  observed  that  there  are  y  clusters,  so  that 
Y  =  y,  then 

X  =  Kt  +  ...  +  Ky 

and  hence  E(X  |  Y  =  y)  =  y  £(/£,)  =  yp. 

(ii)  E(X)  =  ErlE(X\Y)-] 

=  Ey(Yp) 

=  pE(Y) 

= 

since  Y  has  a  Poisson  distribution  with  parameter  AA. 
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4.5(i)  For  fixed  Y  =  y, 

E(X2\Y=y)  =  El(K1+...  +  Ky)2l 

=  E(iK2)  +  2YJE(KiKj). 

\i=  1  /  i<j 

Now  E{K f)  =  (T2  +  IX2,  and  £(KjK,)  =  /r2,  so  this  gives 
E(X2  |  Y=  y)  =  y(a2  +  p2)  +  y(y  -  1  )fx2 
=  ya2  +  y2fi2. 

(ii)  Since  Y  has  a  Poisson(L4)  distribution, 

E(Y)  =  XA,  E(Y2)  =  XA  +  X2A2, 
so 

E(X2)  =  Ey[E(X2  |  F)] 

=  Ey(Ya2  +  Y2fi2) 

=  XAa2  +  (XA  +  X2A2)h2. 

Hence 

V(X)  =  XAa2  +  XAp2 
=  XA(o2  +  n2). 

4.6  For  a  one-dimensional  Poisson  process  in  space,  the 
event  S  >  s  occurs  if  there  is  no  object  for  a  distance  s  on 
either  side  of  the  chosen  object.  So 

F(s)  =  1  -  P(S  >  s) 

=  1  ~(e~Xs)2 
=  1  -e~2Xs. 

Hence /(s)  =  2Xe~2Xs,  and  S  has  an  exponential  distribution 
with  parameter  2X.  The  mean  of  S  is  1/2A.  The  distribution  of 
R  is  the  same,  M( 2X). 

4.7  For  the  median,  m, 

F(m)  =  l 

Hence  e~nXm2  =  0.5,  so 
2  0.693 

m  =  - 7- 

TCX 

and  m  =  0.470/ y/1. 


4.8  E(R2)=  r2f(r)  dr 

Jo 


=  r2  x  2nXre  nX'2  dr 
Jo 

=  f  2nXr3e~nX'2  dr 


=  [-  r2e~*Xr  ]?  +  |  2re~*Xrl dr, 
integrating  by  parts. 


Hence  E(R2)  =  0  + 

J_ 

nX 


nX 


and  V(R)  =  E(R2)  -  [£(K)]2  =  i-  -  — 

nX  4a 

_  4  —  n 
4nX 

The  standard  deviation  of  R  is  0-26  /Ji. 


4.9  The  c.d.f.  of  Z  is 
F(z)  =  P(Z  <  z) 

=  P(kR  <  z) 

=  P(R  <  z/k) 

—  1  _  e-n/.z2/k2^ 

and  this  is  independent  of  /.  when  k  =  J~X.  In  this  case, 
f(z)  =  2nze~nz'. 


4.10(i)  Let  X  =  R2.  Then 
F(x)  =  P(X  <  x ) 

=  P(R2  <  x) 

=  P(R  <  sfx),  since  R  >  0, 

_  ]  _  e~nXx 

so  R2  has  the  exponential  distribution  with  parameter  nX. 

(«)  X  Rf  =  Z  xh  where  the  X(  are  independent  variates, 

i=l  i=l 

each  distributed  as  M(nX).  So  has  a  gamma 
distribution,  T(n,nX). 


4.11  Let  us  call  an  object  ‘paired’  or  ‘single’  according  as  it 
is  in  a  cluster  of  size  2  or  of  size  1. 


In  a  large  area  A,  one  can  expect  to  find  XA  cluster  centres, 
and  hence  pXA  clusters  of  size  2  and  (1  —  p)XA  clusters  of 
size  1.  So  one  expects  2 pXA  paired  objects  and  (1  -  p)XA 
single  objects.  Hence, 


P(an  object  is  paired)  = 


2p 

1  +  p 


P(an  object  is  single)  =  -j - -. 

1  +  p 

Distances  between  neighbouring  cluster  centres  follow  a 
Rayleigh  distribution  with  parameter  X  and  hence  have 
mean  0.5/^/a.  This  is  large  compared  with  d,  since  djl  is 
much  smaller  than  1,  so  an  R-  or  an  S-distance  measured  to 
a  paired  object  can  be  replaced  to  good  approximation  by 
the  corresponding  distance  measured  to  the  object’s  cluster 
centre. 

Accordingly,  R,  the  distance  from  a  randomly-chosen  point 
C  to  the  nearest  object,  can  be  taken  to  be  the  distance  from 
C  to  the  nearest  cluster  centre.  This  has  a  Rayleigh 
distribution  with  parameter  X  and  mean  0.  5/VI,  SO 
E(R)  ~  0.5/y/I  S  is  the  distance  from  a  randomly  chosen 
object  P  to  its  nearest  object.  If  P  is  a  paired  object,  S  will 
be  equal  to  d  in  nearly  all  cases.  If  P  is  a  single  object,  S  can 
be  taken  to  be  the  distance  from  P  (which  is  also  a  cluster 
centre)  to  the  nearest  cluster  centre.  This  has  the  same 
Rayleigh  distribution  as  above,  with  parameter  X  and  mean 
0.5/JX.  Hence 


£(S)--7L'*  + 
1  +P 

It  follows  that 


1  ~  P)  0-5 

x  +  p)j i 


E(S)/E(R)  *  -AB-2 dJX  +  j — - 

1  +  P  V  1  +  P 


I  +  P 
1  ~  P 
1  +P 

(where  «  means  ‘much  smaller  than’). 


since 


dJ~X  «  1 


Clearly  <  L  typical  of  a  clustered  pattern. 


36 


Section  5 


5.1  The  25  quadrat  counts  are  as  follows.  (You  may  have 
made  a  different  decision  about  allocating  borderline  cases  to 
quadrats.) 


15  8 

5 

9  10 

8  6 

4 

4  4 

10  8 

7 

7  10 

8  17 

4 

13  9 

9  15 

5 

3  13 

=  211, 

X*.2 

=  2133, 

X(x,  -x)2  =  352.16, 

25  x  352.16 
t  = - — - - =  41 


The  parameter  v  =  24,  so  for  a  level  of  0.05,  the  critical 
region  is  t  <  12.401  and  t  >  39.364.  The  observed  value  falls 
in  the  critical  region,  so  the  null  hypothesis  of  randomness  is 
rejected.  Since  t  is  larger  than  would  be  expected,  we 
conclude  that  there  is  evidence  of  clustering  of  the  white  oak 
trees. 


5.2  Although  the  quadrats  are  different  shapes,  they  all 
have  the  same  area,  so  the  x2  dispersion  test  can  be  applied. 
We  have 

Ac  =  24,  £  x,.  =  436,  £x?  =  11  008, 

£(xf-x)2  =  3087.33, 
t  =  169.94. 

When  v  =  23,  the  critical  region  with  a  level  of  0.01  is 
t  <  9.260  and  t  >  44.181.  The  observed  value  is  a  long  way 
inside  the  critical  region,  so  there  is  very  strong  evidence  that 
the  woodlice  do  cluster  together. 

5.3  To  use  square  quadrats  would  involve  waste  of 
information  along  the  diagonal.  The  area  of  survey  of  region 
I  is  a  right-angled  triangle,  so  it  can  conveniently  be  divided 
into  quadrats  of  the  same  shape.  Since  N  =  72,  we  must 
choose  k  <  N/4  =  18.  A  convenient  number  of  quadrats 

is  16. 

5.4  Yes.  The  effective  area  served  by  the  post  offices,  i.e. 
the  total  area  in  the  rectangular  frame  less  the  area  of  parks 
and  river,  needs  to  be  divided  into  quadrats  of  equal  area. 
These  can  be  of  irregular  shape  and  could  be  constructed  by 
counting  squares  on  graph  paper.  There  may  be  some  loss  of 
coverage  at  the  boundaries  of  the  parks  and  river,  but  this 
needs  to  be  kept  to  a  minimum.  The  procedure  is  rather 
messy  and  time-consuming,  but  nonetheless  practicable. 
However,  in  the  following  subsection,  a  test  is  introduced 
which  is  more  appropriate  for  this  type  of  situation. 

5.5  For  these  data, 


527 
“  51 
=  10.33. 


Since  a2  =  1%  and  vx  —  v2  =  2  x  5  =  10,  the  critical  region 
is  given  by  h  >  5.847  and  h  <  0.171  =  1/5.847.  There  is 
therefore  strong  evidence  that  the  pattern  is  not  random  but 
displays  clustering. 


t 


